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A polynucleotide functionally coding for the LHP protein from 
Mycobacterium tuberculosis, its biologically active derivative fragments, as well as 

methods using the same. 

5 

I. Background of the invention 

1) Field of the invention. 

10 The present invention is directed to a polynucleotide comprising an open 

reading frame coding for an polypeptide from Mycobacterium tuberculosis, named 
LHP capable of inducing an immune response in a host, said LHP is placed under the 
control of its own regulation signals which are functional in mycobacteria, specially in 
mycobacteria belonging to the Mycobacterium tuberculosis complex and also in fast 

f£ growing mycobacteria such as Mycobacterium smegmatis and also in E. coli. The 
Mycobacterium tuberculosis complex has its usual meaning, i. e. the complex of 
mycobacteria causing tuberculosis which are Mycobacterium tuberculosis, 
Mycobacterium bovis, Mycobacterium africanum, Mycobacterium microti and the 
vaccine strain M. bovis BCG. 

2$ The invention i^also directed to the polypeptide LHP encoded by Ihp and most 

preferably to suitable antigenic portions of LHP as well as to oligomeric polypeptides 
containing more than one unit of LHP or an antigenic portion of LHP. The invention 
concerns also immunogenic and vaccine compositions containing a polypeptide or an 
oligomeric polypeptide such as defined above or live recombinant attenuated 

25 mycobacteria transformed with a polynucleotide according to the present invention. 
The invention also concerns antibodies directed specifically against such polypeptides 
that are useful as diagnostic reagents. In another embodiment, the present invention is 
directed to a polynucleotide carrying the natural regulation signals of Ihp which is 
useful in order to express heterologous proteins in mycobacteria as well as functionally 

30 active regulatory polynucleotides derived from said regulatory region. Finally, the 
present invention is directed to oligonucleotides comprising at least 12 consecutive 
nucleotides which are useful as reagents for detecting the presence of Mycobacterium 
tuberculosis in a biological sample. 



2) Related Prior art. 

Mycobacterium tuberculosis andM bovis cause tuberculosis, a disease which 
currently kills three million people each year. The virulence of pathogenic 
mycobacteria is associated with their ability to parasitize and survive within 
phagocytic cells. Little is known about mechanisms governing gene expression during 
the intracellular growth stage. This issue is of prime importance as the intracellular 
stage of pathogenic mycobacteria can be viewed as an adaptative process, involving 
transcriptional regulatory mechanisms. Mycobacterial genes affecting intracellular 
growth and virulence are beeing actively sought (Collins, 1996; Collins, 1995, Quixm, 
1996). Using subtractive genomic hybridization between virulent M. bovis and the 
attenuated vaccine strain M bovis BCG, Maheiras et al (Maheiras et al., 1996) 
identified three regions of difference (RD1 to RD3). RD1 was detected in all strains of 
M. tuberculosis andM bovis tested but is absent in all BCG substrains, suggesting that 
it may be an important determinant of virulence. 

The orflC gene, encoding the early secreted antigenic target 6kDa (ESAT-6) 
lies within RD1. The ESAT-6 protein is a major T-cell antigen which has been purified 
from M. tuberculosis short-term culture filtrates (Harboe et al., 1996; Sorensen et al., 
1995). Purified ESAT-6 stimulates the production of gamma interferon from mice 
memory immune T lymphocytes and may contribute to the development of 
antituberculous immunity (Andersen et al., 1995; and US Patent Application filed on 
June 5, 1995). 

The Mycobacterium genus encompasses more than 70 recognized bacterial 
species including M. tuberculosis andM leprae, the agents of tuberculosis and leprosy 
respectively. The development of effective prophylactic vaccine and specific 
diagnostic reagents is a priority to control the extension of mycobacterial infections. In 
that context, mycobacterial protein antigens are extensively screened upon their ability 
to induce B- and T-cell reactivity. Obtention of purified proteins from slow growing 
pathogenic mycobacteria is labor-intensive and requires important containement 
facilities. Alternatively, many immunological studies of mycobacterial antigens have 
been conducted with E. co/z-expressed recombinant molecules. However, problems 
related to lipopolysaccharide (LPS) contamination are frequently encountered. 
Moreover, post-translational modifications such as proteolytic processing, intern 
removal, lipid acylation and glycosylation of proteins have been reported to occur in 
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tuberculosis 19kDa lipoprotein antigen against proteolysis (Hermann). Accordingly, 
there is a great need in the art of suitable protein expression systems allowing the 
preparation of mycobacterial immunogenic polypeptides that are useful for diagnostic 
and vaccine purposes. 

5 

II. Summary of the invention. 

Now, the inventors have discovered a polynucleotide carrying the regulatory 
expression signals of the ESAT-6 protein as well as an open reading frame coding for a 
new antigenic protein from Mycobacterium tuberculosis that they have named LHP. 

10 The LHP polypeptide of the invention share a great similarity with a 

Mycobacterium tuberculosis peptide described in the PCT Application N° WO 
97/09429 or in the PCT Application N° WO 97/09428 (Corixa Corporation) a partial 
sequence of which is disclosed in those Patent Applications. 

P The present inventors have characterized the portions of the polynucleotide 

l| according to the invention that are functional in mycobacteria in order to allow the 
expression of LHP, as well as the expression of an heterologous polypeptide that is 
placed under the control of said regulatory region contained in the polynucleotide 
according to the present invention. 

More specifically, the inventors have located the transcription initiation sites of 

2G the Ihp/orflC operon using M, tuberculosis RNA and have precisely mapped the 
portions of the regulatory region of the Ihp/orflC operon that are functional in bacteria 
in general, being functionally active in E. coli as well as in mycobacteria. Further, the 
inventors have mapped the portions of the polynucleotide according to the present 
invention that are functionally active in slow growing mycobacteria, such as bacteria 

25 belonging to the Mycobacterium tuberculosis complex, and in fast-growing 
mycobacteria, such as M. smegmatis. 

Further, the present inventors have used the functionally active portions of the 
regulatory region of the Ihp/orflC operon for expressing a polypeptide heterologous 
with respect to said regulatory region. 

30 In a specific embodiment, the present inventors have constructed a 

mycobacterial expression vector allowing production of recombinant proteins tagged 
by a stretch of six histidine. Such vector enables production of virtually any polypetide 
o mv,^v>nrtpn^l mntpYt sinH ^"ows easv purification of native proteins by 



proteins for which no specific immune reagent are available. This system is very useful 
for biochemical and immunological characterization of mycobacterial proteins. 

Accordingly, given its high level and constitutive expression of the regulatory 
polynucleotide according to the present invention in mycobacteria, said promoter is 
used to construct a novel mycobacterial expression /purification system. 

This vector designated pIPX30, allows versatile gene fusions to produce 
hisbdine-tagged proteins in mycobacteria. Additionally, the high affinity of poly- 
histidine for immobilized metal ions enables one-step chromatographic isolation of 
native, histidine-tagged polypeptides. As a validation of the system, the inventors have 
performed the expression of recombinant DES(ffistidine) 6 M. tuberculosis protein 
antigen and its immunodetection from M. smegmatis cultures. 

Thus, the present invention is directed to a polynucleotide comprising a 
functional portion of the regulatory region of the /hp operon and to its use in a 
recombinant expression vector carrying a polynucleotide encoding a polypeptide of 
interest. 

The invention also concerns recombinant expression vectors containing a 
polynucleotide according to the invention, and more specifically a polynucleotide 
carrying one of the regulatory polynucleotides characterized by the inventors. 

The invention is also directed to recombinant cell host containing a 
polynucleotide or a recombinant vector as defined above. 

In another embodiment, are also part of the present invention the entire LHP 
antigenic polypeptide as well as particular antigenic portions of the LHP polypeptide 
that have been identified by the inventors 

A further embodiment of the present invention is directed to oligomeric 
polypeptides that contain at least one unit of an antigenic portion of the LHP 
polypeptide, that are useful as immunogenic molecules. Consequently, the present 
invention concerns also immunogenic compositions as well as vaccine compositions 
that are useful to diagnose and to prevent an infection by mycobacteria belonging to 
the M. tuberculosis complex, and more specifically by Mycobacterium tuberculosis in 
humans and animals. 

Another object of the present invention consists in a polyclonal or a monoclonal 
antibody directed specifically against the LHP polypeptide or an antigenic portion 
thereof. 
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to perform the diagnosis of an infection with Mycobacterium tuberculosis in a 
biological sample. 

Finally, the invention pertains to immunogenic and vaccine compositions 
containing at least a polypeptide or a recombinant cell host expressing the LHP 
5 polypeptide, preferably in combination with the EAST-6 antigenic protein and also to 
vaccine compositions containing live non pathogenic recombinant cell hosts expressing 
these polypeptides. 

III. Brief description of the Figures : 
10 Figure 1 : Main features of the nucleotide insert contained in plasmid pIPX26 that has 

been deposited at the CNCM on May 14, 1996 under the accession number 1-1706. 

This insert contains the whole polynucleotide carrying the lhp-orf\C operon. 

pIPX26 is a shuttle cloning vector (E. co/Z-mycobacteria) of the pPV24 kind confering 
^ kanamycin resistance and carrying a DNA insert at the unique cloning sites Kpnl (Asp 
t| 718) and BamHI). This DNA insert is a 1282 bp DNA fragment form Mycobacterium 

tuberculosis H37Rv, which has been generated by PCR amplification using the 

following pair of primers : 
l{ ESB-1 (S'-GGGGGGATCCGGTACCAGGTGACGTCGTTGTTCAGCCAG-S') 

and ESB-2 (5 '-GGGGGGTACCGGATCCTCGTAGTCGGCCGCCATGACAAC-3 '), 
20; and by digestion with the restriction enzymes Asp718 and BamHI. This DNA fragment 

carries the open reading frame referenced orflC (which is the Ihp gene), the esat-6 (also 

referred to as orflc) gene and its own transcription terminator. This DNA fragment 

comprises also a promoter activity and transcription start sites allowing gene 

expression, including Ihp and esat-6 {orflC) inM smegmatis andM bovis-BCG. 
25 When plasmid pIPX26 is transferred in M. smegmatisl and M. bovis-BCG, the 

ESAT-6 protein, which is normally absent from these mycobacterial strains is 

expressed. This ESAT-6 heterologous expression is detected by Western blot with the 

monoclonal antibody Hyb 76-8 on protein extracts. 

The sequence of the DNA insert of pIPX 26 is presented in Annex II. 

30 

Figure 2 : Gene arrangement upstream from the M tuberculosis orflC gene and lacZ 
gene fusions used in this study. The l.lkb Pstl fragment from pAA249 was blunted 



pairs 0F1 (5-GGGGGGATCCCAGGTGACGTCGT TGTTCAGC-3') and OBI (5'- 
GGGGGGTACCACGGTGACGTCGTTGTTCAGC-3'), OF1 and OB2 (5'- 
GGGGGGTACCAACGGTGACGTCGTTGTTCAGC-3') together with PE-1 (5'- 
GGGGGGTACCGGGTGGCCGGGAAGTCTGTTG-3') and PE-4 (5'- 
GGGGGGATCCCTGCAGCAGGTGACGTCGTTG-3') were used for PCR 
amplification from pIPX61. Plasmids pIPX45, pIPX46 and pIPX18 were obtained by 
insertion of BamRVAspl 18-digested PCR fragments into the corresponding sites in 
pJEM13 and pJEM15. Stem/loops represent probable transcription terminators and 
open triangles indicate 18bp tandem repeats upstream from Ihp. Results of §- 
galactosidase assays are means and standard deviations of three measurements and 
were determined in M. smegmatis as described in (Timm et al., 1994). 

Figure 3 : Map of plasmid pBluescript 11 KS (+/-) phagemid used to construct 
plasmid pIPX61. 

Figure 4 : Main features of the nucleotide insert contained in plasmid pIPX61 that has 
been deposited at the CNCM on May 14, 1996 under the accession number 1-1705. 

Figure 5 : Nucleotide and amino acids sequence features upstream from the M 
tuberculosis orflC start codon. (A) Nucleotide sequence and deduced amino acid 
sequence of Ihp. A potential ribosome binding site (RBS) upstream from the predicted 
start codon is underlined. Transcriptional start sites identified in M. tuberculosis (+1 
Mtb) and in M. smegmatis (+1 Ms) are indicated by triangles. (B) Peptide sequence 
similarity between the predicted M tuberculosis Ihp gene product and the M. leprae 
L45 seroreactive protein antigen (accession number X90946). 

Figure 6 : Mapping of the Ihp-orflC promoter activity. (A) Primer extension mapping 
of the transcriptional start sites (Tl, T2 and T3) in M. tuberculosis. Reverse 
transcription was performed as described in (Berthet et al., 1995) using the E64 
oligonucleotide (5'-CCCTGCAACGAACCTGCCGTCGACTCCACC-3') with (lane 1) 
or without (lane 2) RNA. The DNA ladder was generated by sequencing pIPX61 with 
E64 using the T7 sequencing kit (Pharmacia Biotech). (B) structural features of the M. 
tuberculosis orflC promoter. (C) Primer extension mapping of the transcriptional start 
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Figure 7 : Analysis of the Ihp/orflC messenger RNA transcript. Total RNA was 
extracted from M. tuberculosis broth cultures on day 5 (lane 1 and 2), day 9 (lane 3), 
day 13 (lane 4) and day 16 (lane 5). Total RNA (5^g) was separated on 1% agarose gel 
supplemented with formamide/formaldehyde and processed for Northern blotting as 
5 described in (Sambrook et al., 1989). Hybridization was carried out using the 
radiolabeled ESA-A probe (See Figure 2). Autoradiography was performed for 4 (lane 
1) to 24 hours (lane 2 to 5). 

Figure 8 : Features of the pIPX30 expression/tagging plasmid. Plasmid pIPX 30 is 
10 derived from plasmid pPV24 and is a shuttle plasmid possessing the following features 

1) The origin of replication of pAL5000 for propagation in mycobacteria, the origin of 
replication from vector pUC19 allowing its propagation in E. coli 9 the aph selection 
J gene confering resistance to kanamycin; 

% 2 ) th e promoter region of I hp and ESAT-6 from M. tuberculosis, functionally active in 
{ i\ slow growing (M tuberculosis, M bovis-BCG, etc.) and in fast growing mycobacteria 

l:;- 1 (M smegmatis). 

^ 3) an expression cassette consisting in : Shine-Dalgarno site/ATG from plasmid 
pJEM15, three cloning -sites (BamHI, Kpnl, PstI), a DNA fragment coding for six 
2tM Histidine, two translation stop codons and the transcription terminator form ESAT-6. 

Plasmid pIPX30 has been constructed by digestion of plasmid pPV24 with 
KpnI/PstI, then treated by phage T4 DNA polymerase and then by insertion of an 
expression cassette having blunt-ended at 5' and 3' ends. 
25 Plasmid pIPX30 allows the production of proteins having a six Histidine stretch 

on their NH2 extremity. This feature facilitates their purification by affinity 
chromatography on columns endowed with immobilized metal ions (IMAC). 

Figure 9 : Beta-galactosidase activities of M smegmatis clones containing pIPX34 or 
30 positive (pJN30) and negative (pJEM13) control vectors. 

Figure 10 : Immunodetection of DES-(His)6 in Mycobacterium smegmatis protein 
extracts. Lanes 1-2 : revelation with an anti-DES polyclonal antiserum. Lanes 3-4 : 
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Figure 11 : Map of plasmid pPV24. pPV24 is a shutUe plasmid (E.coli - 
mycobacteria). This plasmid has been constructed in two steps : 

a) A large portion of the ampiciline resitance coding gene as well as the neigbouring 
non-useful sequences of plasmid P UC18 (Ndel + Bsal fragment) have been replaced 
by the kanamycm resistance gene from pUC4K (PstI fragment) which also express in 
mycobacteria. The resulting vector is pPV8 (2.8kb). 

b) The minimal origin of replication of the mycobacterial plasmid pAL500 (EcoRV + 
Hpal fragment) has been cloned at the StuI site from pPV8. The final vector is pPV24 
(5.4 kb), which carries the multiple cloning site from pUC18 and allows the direct 
detection of recombinant host cells on culture medium supplemented with X-Gal. 

Figure 12 : pPXl is a shuttle cloning vector (E. coli - mycobacteria) of the pPV24 
kind, which confers kanamycin resistance and possessing a 855 bp insert at the BamHI 
unique cloning site. The 855 bp insert from Mycobacterium tuberculosis H37 Rv is 
generated by PCR amplification using the following primer pair : 
ESB-1 (5 '-GGGGGGATCCGGTACCAGGTGACGTCGTTGTTCAGCCAG-3 ') 
PCM (5'- 

GGGGGGATCCTCAATGGTGATGGTGATGGTGGAAGCCCATTTGCGAGGACA 
GCGC-3'), 

and then by digestion with the restriction enzyme BamHI. This DNA fragment 
contains the open reading frame referenced OrfiC (which is the Ihp gene) fused to a 
DNA stretch coding for six Histidine. This DNA fragment carries a promoter region 
and transcription start sites, allowing gene expression in Mycobacterium smegmatis 
and Mycobacterium bovis-BCG and Mycobacterium tuberculosis. 

Figure 13 : Predictive analysis of the conformational features and of the 
hydophilicity/hydrophobicity pattern of the LHP polypeptide (Kyte an Doolitlle's 
hydropathy [1982] and Goldman et al. transbilayer helices [1986]). Each point 
represent one ammo acid of the LHP sequence. Long vertical stretches represent alpha 
helix conformation and short vertical stretches represent beta-turns. Circles represent 
the hydrophilicity index of a particular amino acid, taking into account of the 
hydrophilicity/hydrophobicity of the neighbouring amino acids. 
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IV. Detailed Description of the preferred embodiments 

The present inventors have discovered a new polynucleotide and have shown 
that said polynucleotide contained a whole operon consisting in a regulatory region 

5 containing a functional promoter and a functional ribosome binding site that drives the 
expression of two structural genes respectively encoding a new polypeptide named 
LHP and an already known polypeptide named ESAT-6. 

Further, the inventors have discovered that the two structural genes are co- 
transcribed under the control of the said promoter region. 

10 The inventors have further characterized the LHP polypeptide as being a 

polypeptide produced and excreted by Mycobacterium tuberculosis. The inventors 
have also demonstrated that the polypeptide LHP was produced simultaneously with 
the antigenic polypeptide ESAT-6 in Mycobacterium tuberculosis. As shown herein by 

W the inventors, via a micro sequencing method of the peptides excreted in the culture 
medium supernatant of Mycobacterium tuberculosis, the LHP polypeptide is secreted 
by said pathogenic bacterium. 

Moreover, the present inventors have shown that the regulatory region located 

^ at the 5' end of the open reading frame coding for LHP can be successfully used to 
drive the expression of an heterologous polynucleotide as regards to LHP in a 

20 recombinant cell host. 

For this purpose, the inventors have designed three plasmids containing the 
regulatory region of Ihp and esat-6 (orflC), namely plasmids pIPX30, pIPX26 and 
pPXl. 

pIPX30 has been deposited at the CNCM (Collection de cultures de 
25 microorganismes) on February 13, 1997 under the accession number 1-1845. A map of 
plasmid pIPX30 is shown on Figure 8. 

pIPX26 has been deposited at the CNCM on May 14, 1996 under the accession 
number 1-1706. A map of pIPX26 is shown on Figure 1. pIPX26 has been designed 
using a specific plasmid construct named pPV24 (See Figure 11), that has been 
30 deposited at the CNCM on May 14, 1996 under the accesion number 1-1704, which is 
part of the invention. 

pPXl contains the regulatory region of Ihp/orflC, the open reading frame 
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shown on Figure 12. A restriction map of the 855 bp insert of plasmid pPXl is shown 
in Annex III of the instant specification. 

Taking into account that neither the regulatory region sequence nor the LHP 
encoding nucleotide sequence were found to have a strong homology with already 
known nucleotide sequences and then taking into account of their uniqueness in 
mycobacteria, a further object of the present invention consists in polynucleotides 
derived from the polynucleotide containing the Ihp/orflC operon, or alternatively a 
polynucleotide hybridizing under stringent hybridization conditions with the 
polynucleotide containing the Ihp/orflC operon, which are useful as primers or probes 
in order to detect specifically a bacterium of the Mycobacterium tuberculosis species 
in a biological sample. 

Thus, the present invention is directed to a purified polynucleotide wherein said 
polynucleotide is choosen from the group consisting of : 

a) a polynucleotide comprising the following nucleotide sequence of SEQ ID N°l : 

CTGCAGCAGGTGACGTCGTTGTTCAGCCAGGTGGGCGGCACCGGCGGCGGC 
AACCCAGCCGACGAGGAAGCCGCGCAGATG 

GGCCTGCTCGGCACCAGTCCGCTGTCGAACCATCCGCTGGCTGGTGGATCA 
GGCCCCAGCGCGGGCGCGGGCCTGCTGCG 

CGCGGAGTCGCTACCTGGCGCAGGTGGGTCGTTGACCCGCACGCCGCTGAT 
GTCTCAGCTGATCGAAAAGCCGGTTGCCC 

CCTCGGTGATGCCGGCGGCTGTTGCCGGATCGTCGGTGACGGGTGGCGCCG 
CTCCGGTGGGTCCGGGAGCGATGGGCCAG 

GGTTCGCAATCCGGCGGCTCCACCAGCCCGGGTCTGGTCGCGCCGGCACCG 
CTCGCGCAGGAGCGTGAAGAAGACGACGA 

GGACGACTGGGACGAAGAGGACGACTGGTGAGCTCCCGTAATGACAACAG 
ACTTCCCGGCCACCCGGGCCGGAAGACTTG 

CCAACATTTTGGCGAGGAAGGTAAAGAGAGAAAGTAGTCCAGCATGGCAG 
AGATGAAGACCGATGCCGCTACCCTCGGGC 

AGGAGGCAGGTAATTTCGAGCGGATCTCCGGCGACCTGAAAACCCAGATCG 
ACCAGGTGGAGTCGACGGCAGGTTCGTTG 

CAGGGCCAGTGGCGCGGCGCGGCGGGGACGGCCGCCCAGGCCGCGGTGGT 
GCGCTTCCAAGAAGCAGCCAATAAGCAGAA 
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TCGCGGGTATCGAGGCCGCGGCAAGCGCAATCCAGGGAAATGTCACGTCCA 
TTCATTCCCTCCTTGACGAGGGGAAGCAG 

TCCCTGACCAAGCTCGCAGCGGCCTGGGGCGGTAGCGGTTCGGAGGCGTAC 
CAGGGTGTCCAGCAAAAATGGGACGCCAC 
5 GGCTACCGAGCTGAACAACGCGCTGCAGAACCTGGCGCGGACGATCAGCG 
AAGCCGGTCAGGCAATGGCTTCGACCGAAG 

GCAACGTCACTGGGATGTTCGCATAGGGCAACGCCGAGTTCGCGTAGAATA 
GCGAAACACGGGATCGGGCGAGTTCGACC 

TTCCGTCGGTCTCGCCCTTTCTCGTGTTTATACGTTTGAGCGCACTCTGAGA 
10 GGTTGTCATGGCGGCCGACTACGA 

b) a polynucleotide comprising the following nucleotide sequence of SEQ ID N°2, 
starting at its 5 'end with the nucleotide in position 1 of SEQ ED N°l and ending at its 
3' end with the nucleotide in position 524 of SEQ ED N°l, or a biologically active 
polynucleotide derivative of SEQ ID N°2 : 

3 CTGCAGCAGGTGACGTCGTTGTTCAGCCAGGTGGGCGGCACCGGCGGCGGC 
J AACCCAGCCGACGAGGAAGCCGCGCAGATG 

GGCCTGCTCGGCACCAGTCCGCTGTCGAACCATCCGCTGGCTGGTGGATCA 

GGCCCCAGCGCGGGCGCGGGCCTGCTGCG 

CGCGGAGTCGCTAGCTGGCGCAGGTGGGTCGTTGACCCGCACGCCGCTGAT 
20; GTCTCAGCTGATCGAAAAGCCGGTTGCCC 

CCTCGGTGATGCCGGCGGCTGTTGCCGGATCGTCGGTGACGGGTGGCGCCG 
CTCCGGTGGGTCCGGGAGCGATGGGCCAG 

GGTTCGC AATCCGGCGGCTCC ACC AGCCCGGGTCTGGTCGCGCCGGC ACCG 
CTCGCGCAGGAGCGTGAAGAAGACGACGA 
25 GGACGACTGGGACGAAGAGGACGACTGGTGAGCTCCCGTAATGACAACAG 
ACTTCCCGGCCACCCGGGCCGGAAGACTTG 

CCAACATTTTGGCGAGGAAGGTAAAGAGAGAAAGTAGTCCAGC 

c) a polynucleotide comprising the following nucleotide sequence of SEQ ID N°3, 
starting at its 5' end with the nucleotide in position 1 of SEQ ID N°l and ending at its 

30 3' end with the nucleotide in position 481 of SEQ ID N°l, or a biologically active 
polynucleotide derivative of SEQ ID N°3 : 

CTGCAGCAGGTGACGTCGTTGTTCAGCCAGGTGGGCGGCACCGGCGGCGGC 
AACCCAGCCGACGAGGAAGCCGCGCAGATG 
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CGCGGAGTCGCTACCTGGCGCAGGTGGGTCGTTGACCCGCACGCCGCTGAT 
GTCTCAGCTGATCGAAAAGCCGGTTGCCC 

CCTCGGTGATGCCGGCGGCTGTTGCCGGATCGTCGGTGACGGGTGGCGCCG 
CTCCGGTGGGTCCGGGAGCGATGGGCCAG 

GGTTCGCAATCCGGCGGCTCCACCAGCCCGGGTCTGGTCGCGCCGGCACCG 
CTCGCGCAGGAGCGTGAAGAAGACGACGA 

GGACGACTGGGACGAAGAGGACGACTGGTGAGCTCCCGTAATGACAACAG 
ACTTCCCGGCCACCCGGGCCGGAAGACTTG 

d) a polynucleotide comprising the following nucleotide sequence of SEQ ID N°4, 
starting at its 5' end with the nucleotide in position 525 of SEQ ID N°l and ending at 
its 3' end with the nucleotide in position 826 of SEQ ID N°l coding for the LHP 
polypeptide : 

ATGGCAGAGATGAAGACCGATGCCGCTACCCTCGGGC 

AGGAGGCAGGTAATTTCGAGCGGATCTCCGGCGACCTGAAAACCCAGATCG 
ACCAGGTGGAGTCGACGGCAGGTTCGTTG 

CAGGGCCAGTGGCGCGGCGCGGCGGGGACGGCCGCCCAGGCCGCGGTGGT 
GCGCTTCCAAGAAGCAGCCAATAAGCAGAA 

GCAGGAACTCGACGAGATCTCGACGAATATTCGTCAGGCCGGCGTCCAATA 

CTCGAGGGCCGACGAGGAGCAGCAGCAGG 

CGCTGTCCTCGCAAATGGGCTTCTG 

d) a polynucleotide comprising at least 12 consecutive nucleotides of a polynucleotide 
chosen among the group consisting of SEQ ID N°2, SEQ ID N°3 or SEQ ID N°4; 

e) a polynucleotide having a sequence fully complementary to a polynucleotide 
choosen among the group consisting of SEQ ID N°2, SEQ ID N°3 or SEQ ID N°4; 

f) a polynucleotide hybridizing under stringent hybridization conditions with a 
polynucleotide choosen among the group consisting of SEQ ID N°2, SEQ ID N°3 or 
SEQ ID N°4. 

By a biologically active polynucleotide derivative of SEQ ID N°2 or SEQ ID 
N°3 according to the present invention is meant a polynucleotide comprising or 
alternatively consisting in a fragment of said polynucleotide which is functional as a 
regulatory region for expressing a recombinant polypeptide in a recombinant cell host. 

More specifically, a typical biologically active polynucleotide derivative of 
SEQ ID N°2 or SEQ ID N°3 is a polynucleotide comprising at least the nucleotide 
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In a particular embodiment of a biologically active derivative of SEQ ID N°2 or 
SEQ ID N°3 the ribosome binding site (Shine Dalgarno sequence) which is located 
from the nucleotide at position 508 to the nucleotide at position 512 of SEQ ID N°l 
may be removed or absent and optionally replaced by a suitable natural or synthetic 
ribosome binding site, depending on the recombinant cell host in which its expression 
is desired. 

As shown by the inventors, LHP is produced in short term culture filtrates of 
Mycobacterium tuberculosis, thus in the same time as ESAT-6. It is greatly expected 
that LHP and ESAT-6 have a synergystic action in inducing a protective immune 
response against a pathogenic mycobacterium, specifically mycobacteria belonging to 
the tuberculosis-complex. Thus, it is a preferred embodiment of the present invention 
to obtain a composition containing simultaneously LHP and ESAT-6, optionally in 
combination with other antigenic proteins from Mycobacterium tuberculosis, such as, 
for example, the 45/47 kDa protein or the 19 kDa, DES, ERP (28Kd) or any protein 
identified by biochemical or genetic means. Such a composition containing both at 
least LHP and ESAT-6 may be under the form of a polypeptide composition or under 
the form of a composition of live recombinant cell host expressing both proteins or an 
admixture of recombiant cell hosts each expressing one protein choosen among LHP or 
ESAT-6, the whole compositions being useful for ixnmunodiagnostics or vaccine 
purposes. 

In a specific embodiment of a recombinant vector according to the present 
invention, such a recombinant vector contains a regulatory polynucleotide of the 
invention which is placed in the suitable frame with regards to a polynucleotide 
containing two open reading frames encoding respectively LHP and ESAT-6. Such a 
plasmid may be, for example, pIPX26 that has been deposited at the CNCM under the 
accession number 1-1706 (See Figure land Annex II). Another suitable recombinant 
plasmid, is plasmid pPXl that is contained in the E. coli strain that has been deposited 
at the CNCM on May 14, 1996 under the accession number 1-1707 (See Figure 12 and 
Annex III). 

In order, to identify the relevant biologically active polynucleotide derivatives 
of the invention that are described hereinbefore, the one skill in the art will refer to the 
Examples 5 and 6 of the instant specification in order to use a reombinant vector 
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one skill in the art being guided by the restiction maps presented in Annexes I to III. 
Annex I represents the 1069 bp nucleotide sequence of the PstI DNA insert contained 
in plasmid pIPX61 (see Figure 3). Annex II represents the 1282 bp nucleotide 
sequence of the Kpnl BamHl DNA insert contained in plasmid pIPX26 (See Figure 1). 
Annex III represents the 855 bp nucleotide sequence of the DNA insert of plasmid 
pPXl. 

Said regulatory polynucleotides may also be prepared by digestion of any of 
SEQ ID N°l, SEQ ID N°2 or SEQ ID N°3 by an exonuclease enzyme, such as for 
example BaB 1 (Wabiko et al., 1986). 

Another object of the present invention is a recombinant vector containing a 
polynucleotide of SEQ ID N°2 or SEQ ID N°3, or a biologically polynucleotide 
derivative thereof, and a polynucleotide coding for a polypeptide. 

In a specific embodiment of the recombinant vector according to the present 
invention, the polynucleotide of SEQ ID N°2 or one of its biologically active 
derivatives, or a biologically active derivative of SEQ ID N°3 lacking the ribosome 
binding site sequence wil have to be located in the suitable frame with an heterologous 
Shine-Dalgarno type sequence in order to allow the expression of the polypeptide 
encoding gene placed under its control. 

The preferred expression vectors carrying the polynucleotide of SEQ ID N°2 or 
SEQ ID N°3 or one of their biologically active polynucleotide derivatives are the 
conventional vectors used for polypeptide expression in bacteria, such as for example 
plasmids of the pUC family or plasmids of the pAL family. 

A specific recombinant vector according to the present invention is the plasmid 
pIPX30 which has been deposited at the CNCM on February 13, 1997 under the 
accession number 1-1845. A map of plasmid pIPX30 is represented on Figure 8. 

The polypeptide encoded by a polynucleotide contained in a recombinant vector 
according to the present invention may be any kind of polypeptide either of eukaryotic 
or prokaryotic origin. 

Preferably said polynucleotide codes for an antigenic protein of a 
mycobacterium, and preferably a mycobacterium belonging to the Mycobacterium 
tuberculosis complex 

In a most preferred embodiment, the encoded antigenic polypeptide or protein is 
a polypeptide which undergoes post translational modifications in the mycobacterium, 
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the expression of which is described by Hemnann et al. (1996), Harris et al., 1994 and 
by Garbe et al. (1993) and possibly LHP or ESAT-6. 

Other antigenic mycobacterial polypeptides of interest that may be expressed 
under the control of a regulatory polynucleotide according to the present invention are 
the followings : 

DnaK, GroEL, GroES, the 45/47 kD polypeptide from Mycobacterium tuberculosis 
(Bengard et al., 1994). 

The present invention concerns also the polynucleotide insert of a recombinant 
vector as defined hereinbefore. 

The invention also concerns a recombinant cell host containing a purified 
polynucleotide insert as defined hereinbefore or a recombinant vector according to the 
invention. 

The recombinant cell host may be a bacteria, such as for example E. coli. 

A recombinant cell host according to the present invention consists in a fast 
growing or a slow growing mycobacterium. Preferably, it consists in a mycobacterium 
belonging to the Mycobacterium tuberculosis complex, more specifically the species 
Mycobacterium tuberculosis itself or Mycobacterium bovis-BCG or mutants of these 
strains. Another embodiment of a mycobacterium recombinant cell host according to 
the present invention consists in Mycobacterium smegmatis. 

Another object of the present invention consists in a purified polypeptide 
produced by a recombinant cell host according to the invention. 

A method for preparing such a recombinant polypeptide comprises typically the 
steps of : 

a) optionally preparing a recombinant vector as described above; 

b) optionally introducing said recombinant vector in a suitable eukaryotic or 
prokaryotic cell host; 

c) cultivating the recombinant cell host of step b); 

d) purifying the recombinant polypeptide produced in the culture supernatant medium 
or in the recombinant cell host cell lysate. 

In another aspect of the present invention, polynucleotides of SEQ ID N°2, SEQ 
ID N°3 or SEQ ID N°4 are useful as starting material in order to design new 
polynucleotides that hybridize specifically under stringent hybridization conditions 
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Consequently, is also part of the present invention a polynucleotide or 
oligonucleotide comprising at least 12 consecutive nucleotides of a polynucleotide 
choosen among the group consisting of SEQ ID N°2, SEQ ID N°3 or SEQ ID N°4 

By a polynucleotide or oligonucleotide hybridizing under stringent 
hybridization conditions according to the present invention is meant a polynucleotide 
that hybridizes with a polynucleotide of SEQ ID N°2, SEQ ID N°3 or SEQ ID N°4 
under the following hybridization conditions : 

The hybridization step is realized at 65°C in the presence of 6 x SSC buffer 5 x 
Denhardt's solution, 0,5% SDS and lOOug/ml of salmon sperm DNA. 
The hybridization step is followed by four washing steps : 

- two washings during 5 min, preferably at 65°C in a 2 x SSC and 0. 1%SDS buffer 

- one washing during 30 min, preferably at 65°C in a 2 x SSC and 0. 1% SDS buffer 

- one washing during 10 min, preferably at 65°C in a 0. 1 x SSC and 0. 1%SDS buffer 

Thus, the polynucleotides of SEQ ID N°2, SEQ ID N°3 and SEQ ID N°4 or the 
nuclei fragments obtained from such polynucleotides may be used to select nucleotide 
primers notably for an amplification reaction such as the amplification reactions 
further described. 

PCR is described in the US patent N° 4,683,202. The amplified fragments may 
be identified by an agarose or a polyacrylamide gel electrophoresis, or by a capillary 
electrophoresis or alternatively by a chromatography technique (gel filtration 
hydrophobic chromatography or ion exchange chromatography). The specificity of the 
amplification may be ensured by a molecular hybridization using as nucleic probes the 
polynucleotides SEQ ID N°2, SEQ ID N°3 and SEQ ID N°4, fragments thereof 
oligonucleotides that are complementary to these polynucleotides or fragment thereof 
or their amplification products themselves. 

Amplified nucleotide fragments are used as probes that are useful in 
hybridization reactions in order to detect the presence of one polynucleotide according 
to the present invention or in order to detect mutations in the SEQ ID N°2 SEO ID 
N°3 and SEQ ID N°4. ^ 

Are also part of the present invention the amplified nucleic fragments 
( amphcons ") defined herein above. 

These probes and amplicons may be radioactively or non-radioactively labeled 
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Preferred nucleic acid fragments that can serve as primers according to the 
present invention are the followings : 

SEQ ID N° 14 : 5 '-CTGC AGC AGGTGACGTCGTTG- 3' (from nucleotide in position 
1 to the nucleotide in position 21 of SEQ ID N°l) 

SEQ ID N°15 : 5 '-CCGGGTGGCCGGGAAGTCTGTGT-3 ' (complementary of the 
sequence from nucleotide in position 468 to the nucleotide in position 446 of SEO ID 
N°l) 

SEQ ID N°16 : 5'-ACTACTTTCTCTTTCTACCTTCC-3' (complementary of the 
sequence from nucleotide in position 519 to the nucleotide in position 497 of Seq ID 

N°l). 

The above-described primers are used in combination for performing a nucleic 
acid amplification of one polynucleotide according to the present invention. Suitable 
pairs of primers used are the followings : 

a) SEQ ID N° 14 and SEQ ID N°15; 

b) SEQ ID N°14 and SEQ ID N°16. 

It is no need to say that anyone of the above-described primers may be also used 
as specific probes according to the invention. 

The primers may also be used as oligonucleotide probes to specifically detect a 
polynucleotide according to the invention. 

Other techniques related to nucleic acid amplification may also be used and are 
generally preferred to the PCR technique. 

The Strand Displacement Amplification (SDA) technique (Walker et al., 1992) 
is an isothermal amplification technique based on the ability of a restriction enzyme to 
cleave one of the strands at his recogntion site (which is under a hemiphosphorothioate 
form) and on the property of a DNA polymerase to initiate the synthesis of a new 
strand from the 3 'OH end generated by the restriction enzyme and on the property of 
this DNA polymerase to displace the previously synthesized strand being localized 
downstream. The SDA method comprises two main steps : 

a) The synthesis, in the presence of dCTP-alpha-S, of DNA molecules diat are flanked 
by the restriction sites that may be cleaved by an appropriate enzyme. 

b) The exponential amplification of these DNA molecules modified as such, by 
enzyme cleavage, strand displacement and copying of the displaced strands. The steps 
of cleavage , strand displacement and copy are repeated a sufficient number of times in 
order to obtain an accurate spTmiih^ti/ ~^--t. 
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stearothermophilus (BSOBl) and a fragment of a DNA polymerase which is devoid of 
any 5'03'exonuclease activity isolated from Bacilllus cladotenax (exo- Bed) [=exo- 
minus-5ca]. Both enzymes are able to operate at 60°C and the system is now 
optimized in order to allow the use of dUTP and the decontamination by UDG. When 
unsing this technique, as described by Spargo et al. in 1996, the doubling time of the 
target DNA is of 26 seconds and the ampllification rate is of 10 10 after an incubation 
time of ISminat 60°C. 

The SDA amplification technique is more easy to perform than PCR (a single 
thermostated waterbath device is necessary) and is faster thant the other amplification 
methods. 

Thus, another object of the present invention consists in using the nucleic acid 
fragments according to the invention (primers) in a method of DNA or RNA 
amplification according to the SDA technique. For performing of SDA, two pairs of 
primers are used : apair of external primers (Bl, B2) consisting in a sequence specific 
of the target polynucleotide of interest and a pair of internal primers (SI, S2) 
consisting in a fusion oligonucleotide carrying a site that is recognized by a restriction 
endonuclease, for exemple the enzyme BSOBl. 

As an illustrative embodiment of the use of the primers according to the 
invention in a SDA amplification reaction, a sequence that is non specific for the target 
polynucleotide and carrying a restriction site for Hindi or BSOBl is added at the 5'end 
of a primer specific either for SEQ ID N°2, SEQ ID N°3 and SEQ ID N°4. Such an 
additional sequence containing a restriction site that is . recognized by BsoBl is 
advantageously the following sequence GCATCGAATGCATGTCTCGGGT, the 
nucleotides represented in bold characters corresponding to the recognition site of the 
enzyme BsoBl. Thus, primers useful for performing SDA amplification may be 
designed from any of the primers according to the invention as described above and 
are part of the present invention. The operating conditions to perform SDA with such 
primers are described in Spargo et al, 1996. 

More specifically, the following conditions are used when preforming the SDA 
amplification reaction with the primers of the invention designed to contain a BsoBl 
restriction site : 

5soBI/exo*Bca [=exo-minus- J Sca] SDA reactions are performed in a 50nl volume with 
final concentrations of 9.5 mM MgCl 2 , 1.4 mM each dGTP, dATP, TTP, dCTP-alpha- 
S, 100 ug/ml acetylated bovine serum albumin. 10 ne/ml human placental DNA 35 
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3mM Tris-HCl, llmM NaCl, 0.3 mM DTT, 4 mM KC1, 4% glycerol, 0.008mM 
EDTA, and varying amounts of target DNA. Prior to the addition of BsoBl and exo" 
Bca, icomplete reactions (35nl) are heated at 95°C for 3 min to denature the target 
DNA, followed by 3 min at 60°C to anneal the primers. Following the addition of a 15 

5 (il enzyme mix consisting of 4 ^1 of BsoBl (40 Units/^1), 0.36 \il exoT3ca (22 
Units/nl), and 10.6 p.1 enzyme dilution buffer (10 mM Tris Hcl, 10 mM MgCl 2 , 50 mM 
NaCl, 1 mM DTT), the reactions are incubated at 60°C for 15 min. Amplfication is 
terminated by heating for 5 min in a boiling water bath. A no-SDA sample is created 
by heating a sample in a boiling water bath immediately after enzyme addition. 

10 Aerosol resistant tips from Continental Laboratory Products are used to reduce 
contamination of SDA reactions with previously amplified products. 

The polynucleotides of SEQ ID N°2, SEQ ID N°3 and SEQ ID N°4 and then- 
above described fragments, especially the primers according to the invention, are 
useful as technical means for performing different target nucleic acid amplification 

W methods such as : 

- TAS (Transcription-based Amplification System), described by Kwoh et al. in 1989; 
5 : - SR (Self-Sustained Sequence Replication), described by Guatelli et al.in 1990. 

^ - NASBA (Nucleic acid Sequence Based Amplification), described by Kievitis et al. in 
I 1991. 

20 - TMA (Transcription Mediated Amplification). 

The polynucleotides of SEQ ID N°2, SEQ ID N°3 and SEQ ID N°4 and then- 
above described fragments, especially the primers according to the invention, are also 
useful as technical means for performing methods for amplification or modification of 
25 a nucleic acid used as a probe , such as : 

- LCR (Ligase Chain Reaction), described by Landegren et al. in 1988 and improved 
by Barany et al. in 1991 who employ a thermostable ligase. 

- RCR (Repair Chain Reaction) described by Segev et al. in 1992. 

- CPR ( Cycling Probe Reaction), described by Duck et al. in 1990. 

30 - Q-beta replicase reaction, described by Miele et al. in 1983 and improved by Chu et 
al. in 1986, Lizardi et al. in 1988 and by Burg et al. and Stone et al. in 1996. 

When the target polynucleotide to be detected is a RNA, for example a mRNA, 
a reverse transcriptase enzyme will be used before the amplification reaction in order 
to obtain a cDNA from the RNA contained in the biological sample. The generated 
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used in an amplification process or a detection process according to the present 
invention. 

Thus, another object of the present invention consists in a method for detecting 
Mycobacterium tuberculosis in a biological sample comprising the steps of : 

a) Bringing into contact the nucleic acid molecules contained in the biological sample 
with a pair of purified polynucleotides primers derived from a polynucleotide of SEQ 
ID N°2, SEQ ID N°3 or SEQ ID N°4; 

b) Amplifying said nucleic acid molecules; 

d) detecting the nucleic acid fragments that have been amplified, for example by gel 
electrophoresis or with a labeled polynucleotide hybridizing specifically with a 
polynucleotide of SEQ ID N°2, SEQ ID N°3 or SEQ ID N°4. 

The invention concerns also the above method, wherein before step a), the 
nucleic acid molecules of the biological sample have been made available to a 
hybridization reaction. 

The invention is also related to a kit for detecting a Mycobacterium tuberculosis 
bacterium in a biological sample comprising : 

a) A pair of purified oligonucleotides primers according to the invention 

b) Reagents necessary to perform a nucleic acid amplification reaction; 

c) Optionally, a purified polynucleotide according to anyone of claims useful as a 
probe. 

The non-labeled polynucleotides or oligonucleotides of the invention may be 
directly used as probes. Nevertheless, the polynucleotides or oligonucleotides are 
generally labeled with a radioactive element ( 32 P, 35 S, 3 H, 125 I) or by a non-isotopic 
molecule (for example, biotin, acetylaminofluorene, digoxigenin, 5- 
bromodesoxyuridin, fluorescein) in order to generate probes that are useful for 
numerous applications. 

Examples of non-radioactive labeling of nucleic acid fragments are described in 
the french patent N° FR-78 10975 or by Urdea et al. or Sanchez-Pescador et al., 1988. 

In the latter case, other labeling techniques may be also used such those 
described in the french patents FR-2,422,956 and 2,518,755. The hybridization step 
may be performed in different ways (Matthews et al., 1988). The more general method 
consists in immobilizing the nucleic acid that has been extracted from the biological 
sample on a substrate (nitrocellulose, nylon, polystyren) and then to incubate, in 
defined conditions, the target nucleic acid with the probe. Subsequently to the 
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molecules formed are detected by an appropriate method (radioactivity, fluorescence 
or enzyme activity measurement). 

Advantageously, the probes according to the present invention may have 
structural characteristics such that they allow the signal amplification, such structural 
characteristics being, for example, branched DNA probes as those described by Urdea 
et al. in 1991 or in the European patent N° EP-0225,807 (Chiron). 

In another advantageous embodiment of the probes according to the present 
invention, the latters may be used as " capture probes and are for this purpose 
immobilized on a substrate in order to capture the targer nucleic acid contained in a 
biological sample. The captured target nucleic acid is subsequently detected with a 
second probe which recognizes a sequence of the target nucleic acid which is different 
from the sequence recognized by the capture probe. 

The oligonucleotide fragments useful as probes or primers according to the 
present invention may be prepared by cleavage of the polynucleotides of SEQ ID N°2, 
SEQ ID N°3 and SEQ ID N°4 by restriction enzymes, the one skill in the art being 
guided by the restriction maps presented in the annexes I and II of the instant 
specification. The experimental procedure conditions suitable for using the restriction 
enzymes are described in Sambrook et al. (1989). 

Another appropriate preparation process of the nucleic acids of the invention 
containing at most 200 nucleotides (or 200 bp if these molecules are double stranded) 
comprises the following steps : 

- synthesizing DNA using the automated method of beta-cyanethylphosphoramidite 
described in 1986; 

- cloning the thus obtained nucleic acids in an appropriate vector; 

- purifying the nucleic acid by hybridizing an appropriate probe according to the 
present invention. 

A chemical method for producing the nucleic acids according to the invention 
which have a length of more thant 200 nucleotides nucleotides (or 200 bp if these 
molecules are double stranded) comprises the fllowing steps : 

- assembling the chemically synthsised oligonucleotides, having different restriction 
sites at each end. 

- cloning the thus obtained nucleic acids in an appropriate vector. 

- purifying the nucleic acid by hybridizing an appropriate probe according to the 
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ensure that their sequences are compatible ( in the appropriate reading frame) with the 
aminoacid sequence of the polypeptide to be produced. 

The oligonucleotide probes according to the present invention may also be used 
in a detection device comprising a matrix library of probes immobilized on a substrate, 
the sequence of each probe of a given length being localized in a shift of one or several 
bases, one from the other, each probe of the matrix library thus being complementary 
of a distinct sequence of the target nucleic acid. Optionally, the substrate of the matrix 
may be a material able to act as an electron donnor, the detection of the matrix 
positions in which an hybridization has occurred beeing subsequently determined by 
an electronic device. Such matrix libraries of probes and methods of specific detection 
of a targer nucleic acid is described in the European patent application N° EP- 
0713,016 (Affymax technologies) and also in the US patent N° US-5,202,231 
(Drmanac). 

Thus, another object of the present invention consists in a method for detecting 
the presence of Mycobacterium tuberculosis bacteria in a biological sample 
comprising the steps of : 

a) bringing into contact a purified polynucleotide derived from SEQ ID N°2, SEQ ID 
N°3 or SEQ ID N°4 with a nucleic acid contained in the biological sample. 

b) detecting the hybrid nucleic acid molecule formed between said purified 
polynucleotide and the nucleic acid molecules contained within the biological sample. 

In a particular embodiment of the above method, the nucleic acid molecules of 
the biological sample have been made available to a hybridization reaction before 
performing step a). 

The invention also concerns a method for detecting a Mycobacterium 
tuberculosis bacterium in a biological sample comprising the steps of : 

a) bringing into contact a purified polynucleotide probe according to the invention that 
has been immobilized onto a substrate with a biological sample. 

b) bringing into contact the hybrid nucleic acid molecule formed between said purified 
polynucleotide and the nucleic acid contained in the biological sample with a labeled 
polynucleotide probe according to the invention, provided that the probe of step a) and 
the probe of step b) have non-overlapping nucleotide sequences. 

The invention pertains also to the above method wherein, before step a), the 
nucleic acid molecules of the biological sample have been made available to a 
hybridization reaction. 
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The invention is also directed to the above method wherein, before step b), the 
nucleic acid molecules that are not hybridized with the immobilized purified 
polynucleotide are removed. 

Another object of the present invention consists in a kit for detecting a 
Mycobacterium tuberculosis bacterium genus in a biological sample comprising : 

a) A purified polynucleotide probe according to the invention 

b) Reagents necessary to perform a nucleic acid hybridization reaction. 

The invention also pertains to a kit for detecting a Mycobacterium tuberculosis 
bacterium in a biological sample comprising : 

a) A purified polynucleotide probe according to the invention that is immobilized onto 
a substrate. 

b) Reagents necessary to perform a nucleic acid hybridization reaction. 

c) A purified polynucleotide probe according to the invention which is radioactively or 
non-radioactively labeled, provided that the probe of step a) and the probe of step b) 
have non-overlapping nucleotide sequences. 

As already specified, the present inventors have characterized a new 
polypeptide, named LHP, that is encoded by the polynucleotide sequence of SEQ ID 
N°l, and more precisely by the polynucleotide of sequence SEQ ID N°4. The 
polynucleotide of SEQ ID N°4 encodes the LHP polypeptide of SEQ ID N°5 which is 
described hereunder. 

Thus, another object of the present invention consists in a purified polypeptide, 
named LHP, and having the following amino acid sequence SEQ ID N°5 : 

MAEMKTDAATLGQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTAAQ 
AAVVRFQEAANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF 

The correspondance between the one letter-code and the three letters-code for 
amino acids is found in the book of Stryer Biochemistry, Third Ed. (1988), which is 
incorporated herein by reference for all purposes. 

In both immunodiagnostics and vaccine preparation, it is often possible and 
practical to prepare antigens from segments of a known immunogenic protein or 
polypeptide. Certain epitopic regions may be used to produce responses similar to 
those produced by the entire antigenic polypeptide. Potential antigenic or 
immunogenic regions may be identified by any of a number of approaches, e.g., 
Jameson-Wolf or Kyte-Doolittle antigenicity analyses or Hopp and Woods (1981) 
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calculated and regions of greatest hydrophilicity determined. Using one or more of 
these methods, regions of predicted antigenicity are derived from the amino acid 
sequence assigned to the polypeptides according to the present invention. 

The present invention is also directed to portions of the polypeptide of amino 
5 acid sequence of SEQ ID N°5 that are highly immunogenic and which may thus serve 
as components of an immunogenic composition or a vaccine composition for the 
purpose of diagnosing or preventing an Mycobacterium tuberculosis infection in a 
patient. 

In order to identify the relevant antigenic or immunogenic portions of the 
10 polypeptide of SEQ ID N°5, the one skill in the art may bring a specific peptide 

derived from the polypeptide of SEQ ID N°5 in the presence of a serum sample of a 

patient infected with Mycobacterium tuberculosis and then detect the complex 

eventually formed between the antibodies contained in the serum sample and the 
Si 1 peptide being assayed. Such a screening assay used to define the relevant 
IS immunogenic portions of the polypeptide of SEQ ID N°5 is advantageously a 
k \ conventional ELIS A type assay, wherein, as an illsutrative embodiment, radioactively 

or fluorescently anti-Ig antibodies are used for detecting the antigen-antibody 
I ' complexes formed. 

Antigenic portions of the LHP polypeptide may obtained by enzymatic cleavage 
2p; of the parent purified polypeptide, the one skill in the art being guided by the digestion 

map of the polypeptide of SEQ ID N°5 represented in Annex IV. 

Preferred antigenic portion of the polypeptide accroding to the present invention 

are comprising the hydrophiiic parts of the LHP polypeptide, as determined notably in 

Figures 13 and 14. 

25 Thus, the preferred antigenic portions of a polypeptide according to the 

invention comprise peptides or pseudopeptides derived from the following peptides 
consisting in : 

a) amino acid in position 1 to amino acid in position 48 of SEQ ID N°5; 

b) amino acid in position 60 to amino acid in position 100 of SEQ ID N°5; 

30 which represent the most hydrophylic regions of the LHP polypeptide of the invetion. 

Specific immunogenic portions of the polypeptide of SEQ ID N°5 characterized 
by the inventors are the followings : 



a) SEQ ID N° 6 : NH2-MAEMKTDAATLGQEAGNFERISGDLKTQIDQVESTAGS 
35 LQGQ WRG AAGT-COOH ; 
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ALSSQMGF-COOH; 

c) SEQ ID N°8 : NH2-QEAGNFERISGDLKTQIDQV-COOH; 

d) SEQ ID N°9 : NH2-GDLKTQIDQVESTAGS-COOH; 

e) SEQ ID N° 10 : NH2-GSLQGQWRGAAGTAAA-COOH; 

f) SEQ ID N°ll : NH2-QEAANKQKQELDEIST-COOH; 

g) SEQ ID N°12 : NH2-STNIRQAGVQYSRADEEQQQ ALSSQMGF-COOH; 

h) SEQ ID N° 13 : NH2-RADEEQQQALSSQMGF-COOH. 

In a preferred embodiment of the immunogenic polypeptide according to the 
present invention, the epitope unit of said polypeptide have from 6 to 50 aminoacids in 
length, preferably from 6 to 20 aminoacids in length and most preferably from 6 to 15 
aminoacids in length, and is capable to induce in vivo a protective immune response 
against the LHP antigen which is expressed by Mycobacterium tuberculosis. An 
immunogenic polypeptide having a long amino acid chain (from 25 to 50 amino acids 
in length) is preferably used in case of conformational epitope units. Furthermore, a 
large epitope unit is expected to carry both a B-epitope and a T-epitope. 

By an epitope or an epitope unit according to the present invention is meant a 
portion of the LHP polypeptide which is delinated by the area of interaction with 
antibodies that are specific to LHP, in particular monoclonal antibodies directed 
against LHP. The above disclosed immunogenic portions of the LHP polypeptide of 
SEQ ID N°5 are all bearing at least one epitope unit. 

Are also part of the immunogenic polypeptides of the present invention those 
polypeptides which comprise, but are not limited to, at least one epitope unit 
recognized by a monoclonal antibody directed against the LHP polypeptide or a 
peptide fragment thereof. 

Specifically, the monoclonal or polyclonal antibody according to the invention 
recognizes the LHP polypeptide of SEQ ID N°5 or one peptide fragment thereof. 

The antibodies may be prepared from hybridomas according to the technique 
described by Phalipon et al. in 1995 or also by Kohler and Milstein in 1975. The 
polyclonal antibodies may be prepared by immunisation of a mammal, especially a 
mouse or a rabbit, with a polypeptide according to the invention that is combined with 
an adjuvant of immunity, and then by purifying of the specific antibodies contained in 
the serum of the immunized animal on an affinity chromatography column on which 
has previously been immobilized the polypeptide that has been used as the antigen. 
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The present invention is also directed to a diagnostic method for detecting the 
presence of a Mycobacterium tuberculosis in a biological sample, said diagnostic 
method comprising the steps of : 

a) bringing into contact the biological sample expected to contain a Mycobacterium 
tuberculosis bacterium with a purified monoclonal or polyclonal antibody according 
to the invention; 

b) detecting the antigen-antibody complexes formed; 

In a specific embodiment of the above diagnostic method, step a) is preceded by 
preparing a purified preparation of the said anti-immunogenic polypeptide monoclonal 
or polyclonal antibody. 

In a preferred embodiment of the above diagnostic method, said method consists 
in an immunoassay including enzyme linked immunoassay (ELISA), immunoblot 
techniques, as well as radio-immunoassays (RIA) which preceding techniques are all 
available from the prior art. 

A typical preferred immunoassay according to the invention comprises the 
following steps : 

a) incubating microtitration plate wells with increasing dilutions of the biological 
sample to be assayed; 

b) introducing in said microtitration plate wells with a given concentration of a 
monoclonal or polyclonal antibody according to the invention; 

c) adding a labeled-antibody directed against human or animal immunoglobulins, 
the labeling of said antibodies being, for example, an enzyme that is able to 
hydrolyse a substrate molecule, the substrate molecule hydrolysis inducing a change 
m the light absorbtion properties of said substrate molecule at a given wavelength, 
for example at 550 nm. 

The present invention also concerns a diagnostic kit for the in vitro diagnosis of 
an infection by Mycobacterium tuberculosis, comprising the following elements : 

a) A purified preparation of a monoclonal or a polyclonal antibody according to the 
invention; 

b) Suitable reagents allowing the detection of the antigen/antibody complexes 
formed , these reagents preferably carrying a label compound ( a marker), or being 
recognized themselves by a labeled reagent. 

c) optionally, a reference biological sample containing the pathogenic 
microorganism antigen recognized by the purified monoclonal or polyclonal 
antibody (positive control); 



27 



d) optionally, a reference biological sample that does not contains the pathogenic 
microorganism antigen recognized by the purified monoclonal or polyclonal 
antibody (negative control). 

The present invention is also directed to a polyclonal or a monoclonal antibody 
directed against an immunogenic peptide according to the invention. 

Are also part of the present invention polypeptides that are homologous to the 
initially selected polypeptide bearing at least an epitope unit. By homologous peptide 
according to the present invention is meant a polypeptide containing one or several 
aminoacid substitutions , in the aminoacid sequence of the initially selected polypeptide 
carrying an epitope unit. In the case of an aminoacid substitution, one or several - 
consecutive or non-consecutive- aminoacids are replaced by " equivalent " aminoacids. 
The expression " equivalent " aminoacid is used herein to name any aminoacid that 
may subsituted for to one of the aminoacids belonging to the initial polypeptide 
structure without decreasing the binding properties of the corresponding peptides to 
the monoclonal antibody that has been used to select the parent peptide and without 
decreasing the immunogenic properties against the specified pathogenic 
microorganism. Thus, an homologous polypeptide according to the present invention 
has the same immunological characteristics as the parent polypeptide (for example as 
the polypeptide of SEQ ID N°5) with respect to the ability to confer increaes resistance 
to infections with bacteria belonging to the tuberculosis complex. 

These equivalent aminoacyles may be determined either by their structural 
homology with the initial arninoacyles to be replaced, by the similarity of their net 
charge, and by the results of the cross-immunogenicity between the parent peptides 
and their modified counterparts. 

The peptides containing one or several " equivalent " aminoacids must retain 
their specificity and affinity properties to the biological targets of the parent protein, as 
it can be assessed by a ligand binding assay or an ELISA assay. 

For example, amino acids may be placed in the following classes : non-polar, 
uncharged polar, basic, and acidic. Conservative substitutions whereby an aminoacid 
of one class is replaced with another amino acid of the same type fall within the scope 
of the subject invention so long as the substitution does not materially alter the 
biological activity of the compound. Table 1 provides a listing of examples of 
aminoacids belonging to each class 
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Class of Amino acid 



Examples of amino acids 



to 



5 



Non polar 
Uncharged polar 
Acidic 
Basic 



A, V, L, I, P, G, F, W 
K S, T, C, Y, N, Q 
D,E 



By modified aminoacid according to the present invention is also meant the 
replacement of a residue in the L-form by a residue in the D form or the replacement 
of a Glutamic acid (E) residue by a Pyro-glutamic acid compound. The synthesis of 
peptides containing at least one residue in the D-form is, for example, described by 
Koch etal. in 1977. 

As an illustrative example, it should be mentioned the possibility to realize 
substitutions without a deep change in the immunogenic polypeptide binding 
properties of the correspondant modified peptides by replacing, for example, leucine 
by valine, it being understood that the reverse substitutions are permitted in the same 
conditions. 

In order to design peptides homologous to the immunogenic polypeptides 
according to the present invention, the one skill in the art can also refer to the 
teachings of Bowie et al. (1990). 

A specific, but not limitative, embodiment of a modified peptide molecule of 
interest according to the present invention, which consists in a peptide molecule 
named herein also " pseudopeptide ", which is resistant to proteolysis, is a peptide iri 
which the -CONH- peptide bound is modified and replaced by a (CH 2 NH) reduced 
bound, a (NHCO) retro inverse bound, a (CH 2 -0) methylene-oxy bound, a (CH 2 -S) 
thiomethylene bound, a (CH 2 CH 2 ) carba bound, a (CO-CH 2 ) cetomethylene bound, a 
(CHOH-CH 2 ) hydroxyethylene bound), a (N-N) bound, a E-alcene bound or also a - 
CH=CH- bound. 

The immunogenic polypeptides according to the present invention may be 
prepared in a conventional manner by peptide synthesis in liquid or solid phase by 
successive couplings of the different aminoacid residues to be incorporated (from the 
N-terrninal end to the C-terminal end in liquid phase, or from the C-terminal end to the 
N-terminal end in solid Dhase> wherein the M-t^rm-rio! >u„ .... 
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For solid phase synthesis the technique described by Merrifield may be used in 
particular. Alternatively, the technique described by Houbenweyl in 1974 may also be 
used or generally any chemical synthesis method well known by the one skill in the 
art, such as for example a chemical synthesis method performed with a device 
5 apparatus commercialized by the Applied Biosystems firm. 

In order to produce a peptide chain using the Merrifield process, a highly 
porous resin polymer is used, on which the first C-terminal aminoacid of the chain is 
fixed. This aminoacid is fixed to the resin by means of its carboxyl groups and its 
amine function is protected, for example, by the t-butyloxycarbonyl group. 
10 A peptide or pseudopeptide according to the present invention is 

advantageously combined with or contained in an heterologous structure, or 
polymerized in such a manner as to enhance their ability to induce a protective 
immune response against the pathogenic microorganism. 
: - As a particular embodiment of the immunogenic polypeptide according to the 

15; present invention, said immunogenic polypeptide comprise more than one epitope unit, 
'A I preferably from 2 to 20 epitope units, more preferably from 2 to 15 epitope units and 
most preferably 3 to 8 epitope units per polypeptide molecule, usable as an active 
>i~ principle of a vaccine composition. 

The immunogenic polypeptides of the invention that comprise more than one 
20. epitope unit are herein termed " oligomeric polypeptides The said polymers may be 
obtained by the technique of Merrifield or any other conventional peptide polymer 
synthesis method well known by the one skill in the art. 

The peptides thus obtained may be purified, for example by high perfomance 
liquid chromatography, such as reverse phase and/or cationic exchange HPLC, as 
25 described by Rougeot et al. in 1994. 

As another particular embodiment of the oligomeric immunogenic polypeptides 
according to the present invention, the peptides or pseudopeptides are embedded 
within a peptidic synthetic matrix in order to form a MAP (Multi-branched Associated 
Peptide) type structure. Such MAP structures as well as their method of preparation 
30 are described by Tarn in 1988 or in the PCT patent application N° W094/28915 
(Hovanessian et al.). The embedding of the peptides or pseudopeptides of therapeutic 
value according to the present invention within MAP type structures are expected to 
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polypeptides are presented via a MAP (Multiple Antigen Peptide) construct This kind 
of presentation system is able to present more than one copy of a selected epitope unit 
per molecule (4 to 8 immunogenic polypeptide mimic per MAP construct molecule) 
embedded m a non immunogenic " carrier " molecule. 
5 Thus, another object of the present invention consists in peptide constructs that 

are able to ensure an optimal presentation of the LHP immunogenic portions of the 
invention to the immune system. 

In a specific embodiment of the peptide constructs according to the invention, 
the immunogenic polypeptides (the epitope units) are part of a MAP construct as 
10 defined above, such MAP construct comprising from four to eight epitope units per 
molecule, for examle grafted on a lysine core. 

Generally, an immunogenic polypeptide according to the present invention will 
m comprise an additional T-epitope that is covalently or non-covalently combined with 
said polypeptide of the invention. In a preferred embodiment, the additional T-epitope 
is covalently linked to the immunogenic polypeptide. 

Illustrative embodiments of a suitable T-cell epitope to be combined with an 
immunogenic peptide mimick according to the invention are, for example, the 
followings : 

- hepatitis delta T-cell epitopes (Nisini et al., 1997); 

- a T-cell epitope from the Influenza virus (Fitzmaurice et al., 1996); 

- a T-cell epitope of woodchuck hepatitis virus (Menne et al., 1997); 

- a T-cell epitope from the rotavirus VP6 protein (Banos et al., 1997); 

- a T-cell epitope from the structural proteins of entroviruses, specifically from the 
VP2, VP3 and VPl capsid proteins (Cello et al., 1996); 

- a T-cell epitope from Streptococcus mutans (Senpuku et al., 1996); or also 

- a T-cell epitope from the VPl capsid peotein of the foot and mouth disease virus 
(Zamorano et al., 1995); 

Preferred additional T-epitopes used according to the present invention are for 
example universal T-epitopes, such as tetanus toxoid or also the VPl poliovirus capsid 
protein (Graham et al., 1993). 

In a most preferred embodiment, the T-cell epitope used consists in a peptide 
comprised between amino acid in position 103 and amino acid in position 115 of the 
VPl poliovirus capsid protein. 

Thus, the MAP construct may comprise an additional T epitope which is 
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construct. Accordingly, the additional T-epitope maybe located at the external end 
(opposite to the core) of the MAP or conversely, the additional T-epitope may be 
directly linked to the core of the MAP construct, thus preceding the immunogenic 
polypeptide which is then external to the MAP construct. 

In another embodiment of the peptide constructs according to the present 
invention, the immunogenic polypeptide is directly coupled with a carrier molecule 
such as KLH (Keyhole Limpet Hemocyanin) or preferably with tetanus toxoid. 

The immunogenic polypeptides according to the invention may be presented in 
different additional ways to the immune system. 

In one specific embodiment the immunogenic polypeptide of the invention may 
be presented under the form of ISCOMs (Immunostimulating complexes) that are 
composed of Quil A ( a saponin extract from Quilaja saponaria olina bark), 
cholesterol and phospholpids associated with the immunogenic polypeptide (Mowat et 
aL, 1991; Morein, T 990, Kerstenetal., 1995). 

The immunogenic polypeptides of the invention may also be presented under 
the form of biodegradable microparticles (microcapsules or microspheres) such as for 
example lactic and glutamic acid polymers as described by Aguado et al. in 1992, also 
termed Poly(lactide-co-glycolide) microcapsules or microspheres. 

Other microparticles used to present the LHP-derived polypeptide antigens of 
the invention are synthetic polymer microparticles carrying on their surface one or 
more immunogenic polypeptides covalently bonded to the material of the 
microparticles, said immunogenic polypeptide(s) each carrying one or more epitope 
units and being present at a density of between 10 4 and 5.10 5 molecules /fim 2 . These 
microparticles have an average diameter of between about 0.25 |im and 1.5 \im y and 
preferentially of about 1 \im so as to be able to be presented to CD4+ T lymphocytes 
by phagocytic cells. Said microparticles are more particularly characterized in that the 
covalent bond is formed by reaction between the NH2 and/or CO groups of the 
immunogenic peptide mimic and the material making up the microparticle. 
Advantageously, such bond is created by bridging reagent as intermediate, such as for 
example glutaraldehyde or carbodiimide. The material of the microparticle can 
advantageously be a biocompatible polymer, such as acrylic polymer, for example 
polyacrolein or polystyrene or the poly(alpha-hydroxy acids), copolymers of lactic and 
glycolide acids or lactic acid polymers, said polymers being a homopolymer or hetero- 
or co-polymer. The above described microparticles characteristics are found in the 
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The immunogenic polypeptide of the invention may also be included within or 
adsorbed onto liposomes particles, such as those described in the PCT Patent 
Application N° PCT/FR 95/00215 Published on August 31, 1995 (Riveau et al.). 

The present invention is also directed to an immunogenic composition 
comprising an immunogenic polypeptide according to the invention, notably under the 
form of a MAP construct or a peptide construct as defined above, and including the 
oligomeric immunogenic polypeptides described hereinbefore, or also under a 
microparticle preparation. 

The invention also pertains to a vaccine composition for immunizing human and 
mammal animals against a Mycobacterium tuberculosis infection, comprising an 
immunogenic composition as described above in combination with a pharmaceutically 
compatible excipient (such as saline buffer), optionally in combination with at least 
one adjuvant of the immunity such as aluminium hydroxide or a compound belonging 
to the muramyl peptide family. 

A vaccine according to the present invention is preferably one which is capable 
of inducing a substantial ans specific acquired immune resistance in a mouse or guinea 
pig against tuberculosis caused by mycobacteria belonging to the tuberculosis- 
complex, which acquired immune resistance corresponds to at least 20 % of the 
protective immune resistance elicited by Mycobacterium bovis-BCG y as assessed by 
the observed reduction in mycobacterial counts from spleen, lung or other organ 
homogenates isolated from the mouse or guinea pig receiving a challenge infection 
with a virulent strain of M. tuberculosis. 

The preferred acquired immune resistance corresponds to at least 50% of the 
protective immune response elicited by M bovis-BCG, such as at least 60%, or even 
more preferred to at least 80% of the protective immune resistance elicited by M. 
bovis-BCG, such as at least 90% and advantageously 100%. 

Various methods for achieving adjuvant effect for the vaccine include the use of 
agents such as aluminium hydroxide or phosphate (alun), commonly used as 0.05 to 
0. 1 percent solution in phosphate buffered saline, admixture with synthetic polymers 
of sugars (Carbopol) used as 0.25 % solution. Another suitable adjuvant compounds 
consist in DDA (dimethyldioctadecylammonium bromide), as well as immune 
modulating substances, such as lymphokines (e.g. gamma-IFN, IL-1, IL-2 and IL-12) 
or also gamma-IFN inducers compounds, such as poly I:C. 
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4,601,903; 4,599,231; 4,599,230; 4,596,792 and 4,578,770, all incorporated herein by 
reference. 

The vaccine composition according to the present invention is advantageously 
prepared as injecteable either as liquid solution or suspension; solid forms suitable for 
solution in or suspension in, liquid prior injection may also be prepared. 

The active immunogenic polypeptide contained in the vaccinal composition is 
generally mixed with excipients which are pharmaceutically acceptable and 
compatible, such as for example, water, saline, dextrose, glycerol, ethanol, or a 
combination of more than one of the above excipients. 

In addition, if desired, the vaccine composition may contain minor amounts of 
auxiliary substances such as wetting or emulsifying agents, pH buffering agents, or 
adjuvants which enhance the effectiveness of the vaccines. 

The vaccines are conventionally administered parenterally, by injection, for 
example, either subcutaneously or intramuscularly. Additional formulations are 
suitable for other modes of administration include suppositories and, in some cases, 
oral formulations, which may be preferred embodiments for the development of a 
desired mucosal immunity. 

The immunogenic polypeptide of the invention may be formulated into the 
vaccine as neutral or salt forms. Pharmaceutically acceptable salts include acid 
addition salts (fromed with free amino groups of the peptide) and which are formed 
with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such 
organic acids as acetic oxalic, tartaric or mandelic acid. Salts formed with the free 
carboxyl groups may also be derived from inorganic bases such as, for example, 
sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic beses 
as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine or procaine. 

The vaccine compositions of the invention are administered in a maimer 
compatible with the dosage formulation, and in such amount as will be therapeutically 
effective and immunogenic. The quantity to be administered depends on the subject to 
be treated, including, e.g., the capacity of the individual's immune system to mount an 
immune response. 

Suitable dosage ranges are of the order of several hundred micrograms active 
immunogenic polypeptide with a preferred range from about 0.1 jig to 1000 ng, such 
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The dosage of the vaccine will depend on the route of administration and will 
vary according to the age of the patient to be vaccinated and, to a lesser degree, the 
size of the person to be vacinated. 

Preferably, both in the case of an immunogenic polypeptide carrying a single 
epitope- unit and in the case of an immunogenic polypeptide carrying several epitope 
units, the vaccine composition is administered to human in the range from 0. 1 to 1 ug 
immunogenic polypeptide per kilogram patient's body weight, preferably in the range 
from 0.5 ug/kg of body weight, this representing a single vaccinal dose for a given 
ad^ninistration. 

In the case of patients affected with immunological disorders, such as for 
example immunodepressed patients, each injected dose preferably contains half the 
weight quantity of the immunogenic polypeptide contained in a dose for a healthy 
patient. 

In many instances, it will be necessary to proceed with multiple administrations 
of the vaccine composition according to the present invention, usually not exceeding 
six acirninistrations, more usually not exceeding four vaccinations, and preferably one 
or more, usually at least about three administrations. The administrations will normally 
be at from two to twelve week intervals, more usually from three to five week 
intervals. Periodic boosters at intervals of 1-5 years, usually three years, will be 
desirable to maintain the desired levels of protective immunity. 

Preferably, the vaccine composition is administered several times. As an 
illustrative example, three vaccinal doses as defined herein above are respectively 
administered to the patient at time tO, at time tO + 1 month and at time tO + 12 months. 

Alternatively, three vaccinal doses are respectively administered at time tO, at 
time tO + l month and at time tO + 6 months. 

The course of the immunization may be followed by in vitro proliferation 
assays of PBL (peripheral blood lymphocytes) co-cultured with the immunogenic 
polypeptide of the invention, and especially by measuring the levels of gamma-IFN 
released from the primed lymphocytes. The assays may be performed using 
conventional labels, such as radionuclides, enzymes or fluorescent compounds. These 
techniques are well known from the one skill in the art and found notably in the US 
Patents N° 3,731,932; 4,174,384 and 3,949,064, which are herein incorporated by 
reference. 
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gamma-IFN will be released, accordingly, a vaccine composition according to the 
invention comprises a polypeptide capable of releasing from the memory T- 
lymphocytes at least 1500 pg/ml, such as 2000 pg/ml, preferably 3000 pg/ml gamma- 
IFN, in the above described in vitro assays. 

In mice, that are administered with a dose comparable to the dose used in 
human, the antiboy production is assayed after recovering the immune serum and 
revealing the immune complex formed between the antibodies present in the serum 
samples and the immunogenic polypeptide contained in the vaccine composition, using 
the usual methods well known from the one skill in the art. 

The immunogenic polypeptides used in the vaccinal strategy according to the 
present invention may also be obtained using genetic engineering methods. The one 
skill in the art will refer to the known sequence of DNA insert that expresses a specific 
antigenic portion (epitope unit) of an immunogenic polypeptide of the invention and 
also to the general litterature to determine which appropriate codons may be used to 
synthetize the desired peptide. 

There is no need to say that the expression of the polynucleotide that encodes 
the immunogenic polypeptide of interest may be optimized, according to the organism 
in which the sequence has to be expressed and the specific codon usage of this 
organism (mammal, plant, bacteria etc.). For bacteria and plant, respectively, the 
general codon usages may be found in the European Patent Application N° EP- 
0359472 (Mycogen). 

As an alternative embodiment, the epitope unit of the immunogenic polypeptide 
contained in a vaccine composition according to the present invention is recombinantly 
expressed as a part of longer polypeptide that serves as a carrier molecule. 

Specifically, the polynucleotide coding for the immunogenic polypeptide of the 
invention, for example a polypeptide having an amino acid length between 10 and 200 
amino acid residues, is inserted at at least one permissive site of the polynucleotide 
coding for the Bordetella cyaA adenylate cyclase, for example at a nucleotide position 
located between aminoacids 235 and 236 of the Bordetella adenylate cyclase. Such a 
technique is fully described in the United States Patent N° 5,503,829 granted on April 
2, 1996(LeclercetaL). 

In another embodiment of the vaccine composition according to the present 
invention, the nucleotide sequence coding for the desired immunogenic polypeptide 
carrying one or more epitope units is inserted in the nucleic sequence coding for an 
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N° PCT/US 96/17698 (The Research Foundation of State University of New York), 
which is herein incorporated by reference. 

In a further embodiment of the vaccine composition acording to the present 
invention, the latter is based upon a live recombinant cell host expressing the entire 
5 LHP polypeptide of sequence SEQ ID N°5 or alternatively a polypeptide containing an 
immunogenic portion of LHP according to the invention or also an oligomeric 
immunogenic LHP-derived polypeptide such as those described hereinbefore. 

The microorganism in the vaccine may be a bacterium such as bacteria selected 
from the group consisting of the genera Mycobacterium, Salmonella, Pseudomonas or 
10 E. coli. 

A preferred embodiment of a vaccine composition containing a live 
recombinant cell host according to the invention consists in a Mycobacterium bovis- 
BCG strain which has been transformed with a polynucleotide encoding the entire 
% LHP polypeptide or alternatively a polypeptide containing an immunogenic portion of 
15) LHP or also an oligomeric immunogenic LHP-derived polypeptide. 
;i ; An advantageous method used to transform a Mycobacterium bovis-BCG strain 

On with a polynucleotide coding for an immunogenic polypeptide according to the present 
invention consists in introducing the polynucleotide of interest via an allelic exchange 
event (homologous recombination involving a double cross-over) or via an 
2p; homologous recombination involving a single cross-over, using a recombinant vector, 
k Such a recombinant vector carries the gene encoding the immunogenic 

^ polypeptide of interest which has been introduced in a polynucleotide counterpart of a 
gene non essential for the growth of Mycobacterium bovis-BCG on the vector, such as 
for example the urease gene. Said vector carries advantageously also a conditional 
25 lethal selection marker such as SacB gene. The relevant transformation methods and 
vectors are fully described by Reyrat et al. (1995) or Pelicic et al. (1996). 

Another embodiment of the live vaccine compositions according to the present 
invention consists in compositions containing live mycobacteria, and preferably live 
Mycobacterium bovis-BCG or mutant derived from Mycobacterium tuberculosis or 
30 Mycobacterium bovis-BCG transformed with a recombinant vector containing an 
antigenic protein placed under the control of a regulatory polynucleotide according to 
the present invention. 

The live vaccine compositions of the invention are administered in a manner 
compatible with the dosage formulation, and in such amount as will be therapeutically 
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be treated, including, e.g., the capacity of the individual's immune system to induce an 
immune response. 

Suitable dosage ranges are of the order of 10 4 to 10 6 cfu (colony forming units) 
at an attenuated recombinant mycobacteria concentration of about 10 6 cfu /mg. Most 
preferably, the effective dose is about 10 5 cfu. 

The dosage of the vaccine will depend on the route of administration and will 
vary according to the age of the patient to be vaccinated and, to a lesser degree, the 
size of the person to be vacinated. Most preferably, the vaccine composition according 
to the present invention is administered via an intradermal route and in a single boost. 

In the case of patients affected with immunological disorders, such as for 
example immunodepressed patients, each injected dose preferably contains half the 
weight quantity of the attenuated mycobacteria contained in a dose for a healthy 
patient. 

In the case of neonates, the dose will be four times less than for an adult, and in 
case of young children (4-6 years old), the dose will be half the dose used for an adult 
healthy patient. 

In some instances, it will be necessary to proceed with multiple administrations 
of the vaccine composition according to the present invention, usually not exceeding 
six administrations, more usually not exceeding four vaccinations, and preferably one 
or more, usually at leastabout three administrations. The administrations will normally 
be at from two to twelve week intervals, more usually from three to five week 
intervals. Periodic boosters at intervals of 1-5 years, usually three years, will be 
desirable to maintain the desired levels of protective imm unity, 

Immunization by DNA-based vaccines has been the object of several studies 
since the beginning of the 1990s. A DNA-based vaccine involves the transfer of a gene 
or at least a portion of a gene, by direct or indirect means, such that the protein 
subsequently produced acts as an antigen and induces a humoral-and/or cellular 
mediated immunological response. 

Ulmer et al. -Science, 259: 1745-1749 [1993] obtained protection against the 
influenza virus by induction of the cytotoxic T lymphocytes through injection of a 
plasmid coding for an influenza A nucleoprotein into the quadriceps of mice. The 
plasmid used carries either the Rous sarcoma virus promoter or the cytomegalovirus 
promoter. 

Raz et al. -Proc.Natl. Acad .Sci. USA 90 :4523-4527 T19931 injected vectors 
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cell-mediated immune response of the mice to which these plasmids have been 
intramuscularly administered are improved. 

Wang et al. -Proc. Natl. Acad .Sci. USA 90 : 4156-5160 [1993] injected a 
plasmid carrying a gene coding for the envelope protein of the HIV-1 virus into mice 
muscles. The plasmid injection was preceded by treatment with bupivacaine in the 
same area of the muscle. The authors demonstrated the presence of antibodies capable 
of neutralizing the HTV-1 virus infection. However, the DNA was injected twice a 
week for a total of four injections. 

Davis et al. (Compte-Rendu du 28eme Congres Europeen sur le muscle, 
Bielefeld, Germany, 21-25 September 1992) injected plasmids carrying a luciferase or 
P-galactosidase gene by pretreating the muscles with sucrose or a cardiotoxin. The 
authors observed the expression of luciferase or P-galactosidase. 

More recently, an article published in Science et Avenir (September 1993 : 22- 
25) indicates that Whalen and Davis succeeded in immunizing mice against the 
hepatitis B virus by injecting pure DNA from the virus into their muscles. An initial 
injection of snake venom toxin, followed 5 to 10 days later by a DNA injection, is 
generally described. However, the authors specify that this method is not practical. 

These studies were preceded by others experiments in which various DNAs 
were injected, in particular into muscle tissues. For example, the US Patents No 
5,589,466 and No 5,580,859 (VICAL INC) and the International application 
PCT/US90/015 15 (published under No WO/90/11092) disclose various plasmid 
constructions which can be injected in particular into muscle tissues for the treatment 
of muscular dystrophy. However, this later document specifies that DNA is 
preferentially injected in liposomes. 

Additionally, Canadian Patent CA 362 96630 (published under No 1,169,793) 
discloses the intramuscular injection of liposomes containing DNA coding, in 
particular, for HBs or HBc antigens. The results described in this patent mention the 
HBs antigen expression. The presence of anti-HBs antibodies was not investigated. 

International Application PCT/FR92/00898 (published under No WO93/06223) 
discloses viral vectors which can be conveyed to target cells by blood. These vectors 
are recognized by the cell receptors, such as the muscle cells, and can be used in the 
treatment of muscular dystrophy or thrombosis. 
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The present invention relates to a composition capable of inducing an immune 
response, and more particularly, an humoral or/and a cytotoxic response comprising a 
nucleotide sequence expressed in muscle cells. The nucleotide sequence comprises a 
gene or complementary DNA coding for at least a portion of nucleotidic sequence 
comprised in the pIPX61 insert preferably the Ihp polynucleotide coding region and a 
promoter and/or regulatory region allowing for the expression of the gene or 
complementary DNA in the muscle cells. 

The invention further relates to the vector, which serves as a vehicle for the 
gene or complementary DNA coding for at least lhp polynucleotide coding region and 
a promoter allowing for the expression of the gene or cDNA which is aclministered to 
an individual to be immunized. 

The present invention will be fully illustrated by the Examples described below, 
although the scope of the invention cannot in anyway be limited to these embodiments. 

EXAMPLES 

Example 1 : Genetic organization upstream from the M. tuberculosis orflC gene. 

To isolate potential promoter region, the inventors have cloned the l.lkb DNA 
sequence upstream from the M. tuberculosis orflC gene. A 150 bp DNA fragment 
covering the first half , of the orflC gene was obtained by digestion of the plasmid 
pAA249 with EcoKUPstl, radioactively labeled and used to probe a cosmid library of 
Mycobacterium tuberculosis strain H37Rv by colony hybridization (Sambrook et al., 
1989). A 1.1 kb Pstl restriction fragment shared by three strongly hybridizing cosmids, 
was transferred to pBluescript II KS + to give pEPX 61 (Fig. 2). Double-stranded DNA 
sequencing revealed perfect nucleotide identity between the 1069 bp insert of pIPX61 
and its counterpart in M. bovis RD1 (Maheiras et al., 1996). It included a 285bp open 
reading frame preceded by a potential ribosomal binding site (AGAGA) in the same 
transcriptional orientation as orflC (Fig. 5A). This ORF was designated lhp (L45 
homologous protein) since its deduced product shared 40% peptide identity with the 
M. leprae L45 seroreactive antigen (Fig. 5B). L45 seroreactive antigen is strongly 
recognized by sera from lepromatous leprosy patients (Sathish et al., 1990) but its 
function is currently unknown, lhp was not annotated in the M. bovis RD 1 sequence 
publis hed by Maheiras et al. CMaheiras et al 1996) and overlans with thr T ™a nf th~ 
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To investigate promoter activity, the inventors have constructed translational 
fusions between orflC, Ihp and the lacZ reporter gene. orflC and Ihp were inserted 
into promoter probe vectors of the pJEM series (Timm et al., 1994), out- or in frame 
with regard to lacZ. The resulting plasmids were named pIPX15, pIPX16 and pIPX46, 
5 pIPX47 respectively (Fig. 2). These constructs were introduced by electroporation in 
M smegmatis mc^ 155 and §-galactosidase activity was assayed in bacterial cell 
extracts. Strong §-galactosidase activity was detected in extracts of cells carrying in 
frame fusions (pIPX16 and pIPX47) but not in extracts of cells carrying out-of-frame 
fusions (pIPX15 and pIPX46). Thus (i) Ihp is expressed and translated (ii) there is a 

10 mycobacterial promoter activity somewhere in the 900 bp upstream from the orflC 
start codon. The differences in levels of §-galactosidase activity produced from 
pIPX16 and pIPX47 may reflect differences in the stability of LacZ fusion proteins. 
Alternatively, this may be attributable to differences in the efficiency of Ihp and orflC 

} il translation signals. In that respect, a long (A+G)-rich stretch upstream from the orflC 

12} ATG and overlapping the predicted ribosomal binding site may potentially alter the 

% \ translation of orfl C-lacZ. 

r: Example 3 : Mapping of the Ihp/orflC promoter activity. 

The inventors have performed primer extension experiments to map the site(s) 
20: of Ihp/orflC transcription initiation. Total RNA was extracted (Bashyam et al., 1994) 
h- from M tuberculosis and M smegmatis mc^lSS transformed with pIPX16. By 
M| walking upstream from the orflC start codon, we identified one major and two 
secondary transcriptional start sites in M. tuberculosis (Fig. 6A). The sites are close 
together within a:' region of 30 bp about 430 nucleotides upstream from the orflC ATG 
25 start codon. The (A+T) rich [TAATGA] region may correspond to the -10 hexamer 
motif identified in promoters of other bacterial genera. The corresponding -35 region 
contains two tandem repeats of a 18 nucleotide motif extending from positions -25 to - 
60 (Fig. 6B). The significance of this organization is unknown but may serve a 
regulatory function (Collado-Vides et al., 1991). The positions of the transcriptional 
30 start sites detected in M. tuberculosis are consistent with Ihp and orflC being 
cotranscribed. 

Surprisingly, the transcription start sites detected with RNA extracted from M 
smegmatis mc 2 155 [pIPX16] differed from those in M tuberculosis (Fig. 6C). The 
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smegmatis. To test this, a 480bp DNA fragment encompassing the transcription start 
sites identified in M. tuberculosis was inserted into the vector pJEM15, creating a 
transcriptional fusion with lacZ. The resulting plasmid (pIPX18) was introduced into 
M. bovis BCG and lacZ activity was detected by the appearence of blue colonies on 
7H10 X-Gal indicator plates. In contrast, no activity was observed with colonies of M. 
smegmatis mc 2 155 [pIPX18] grown on the same plates. The absence of significant §- 
galactosidase activity in mc 2 155 [pIPX18] was confirmed by standard §-galactosidase 
assays (Fig. 2). This data suggests that correct expression of Ihp-orflC required 
factor(s) absent from or not functional in M. smegmatis. 

Example 4 : Ihp /orflC are organized as an operon. 

To confirm that Ihp and orflC belong to the same transcriptional unit, the 
inventors used the ESA-A probe (see Fig. 2) for Northern blotting hybridization with 
total M. tuberculosis RNA. A strong hybridization signal migrating at about 800bp 
(estimated using RNA molecular weight standards) was detected (Figure 7, lanes 1 and 
2). Moreover, comparable amounts of this transcript were detected in early (day 5) and 
late (day 16) cultures (Figure 7, lanes 3 to 6). Consequently the Ihp-orflC transcript 
appears to be produced in M. tuberculosis from a constitutive high level promoter, 
and/or alternatively, is very stable. In view of the position of the +1 sites, a transcript 
of 800 bp covers both Ihp and orflC Furthermore, there is a structure similar to a 
Rho-independent transcription terminator, 790 bp dowstream from the Ihp /orflC 
major transcriptional start site. 

Example 5 : Construction of the pIPX30 expression/tagging vector 

The design of the pIPX30 was carried out in two steps. First, we constructed a 
small sized ^co/z-mycobacteria shuttle plasmid harbouring a convenient multiple 
cloning site and a selectable marker gene confering resistance to kanamycine. The PstI 
fragment from pUC4K together with the NdeVBsal-digested pUC18 were blunted with 
phage T4 polymerase (Amersham) and ligated to each other. The resulting plasmid 
designated pPV8 was digested by StuI and ligated to the EcoKV/Hpal fragment from 
pALSOOO to give pPV24. The pPV24 plasmid is a multipurpose shuttle cloning vector 
harbouring four unique restriction sites (Kpnl, BamW, Xbal, PstI). It allows alpha 
complementation and blue/white selection of recombinants in E. coli. 
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the two ovelaping oligonucleotides XL1 0 and XL2 0- Since the promoter and linker 
fragments carried a 12 bp overlaping region, they were recombined in vitro by PCR 
amplification using the XP1 and XL2 oligonucleotides. This PCR-tailored fragment 
was inserted in pPV24 previously digested with KpriUPstl and blunted with T4 
polymerase, resulting in the plasmid pIPX30. As presented figure 1, the pIPX30 
expression cassette is composed of (i) a Shine Dalgarno motif functional in 
mycobacteria (ii) a translation initiation codon followed by three unique cloning sites, 
allowing gene fusions with a stretch of DNA coding for six histidine (iii) two 
translational termination codons and the ESAT-6 transcription terminator. 
2.2. pIPX30 promotes high level expression of §-galactosidase in mycobacteria 

To characterize this novel expression vector, a truncated lacZ reporter gene, 
generated by AspHS/Pstl digestion of promoter-probe plasmid pJEM13, was inserted 
into the corresponding sites of pIPX30. In the resulting plasmid designated pIPX34, 
the lacZ gene is in frame with regard to the pIPX30 ATG initiation codon. When 
introduced into M. smegmatis mc2 155, pIPX34 produced a high level of §- 
galactosidase activity (Fig. 2). This level of §-galactosidase activity is comparable to 
the one obtained in pJN30 extracts, where lacZ is under the control of the strong 
pBlaF* promoter of Mycobacterium fortuitum. When transformed in Mycobacterium 
bovis BCG, the pIPX34 construct resulted in dark blue colonies in presence of the §- 
galactosidase X-gal chromogenic substrate. These observations indicated that the 
combination of promoter/expression cassette used in pIPX30 is functional in 
representative members of fast- and slow-growing mycobacteria. 

Example 6 : Expression and immunodetection of DES (His6) in M. smegmatis 

To validate pIPX30 as an expression/tagging vector, we expressed in this 
system the DES antigen of M tuberculosis . The des gene was recently cloned from 
Mycobacterium tuberculosis and encodes DES, a protein sharing conserved motifs 
caracteristic of the class II diiron-oxoprotein family. DES is putative A-9 (delta 9 ) 
desaturase and could potentially be involved in the biosynthesis of mycobacterial 
lipids and mycolic acids. Moreover, DES is strongly recognized by sera from 
tuberculosis patients and represent a potential diagnostic reagent. To express DES in a 
mycobacterial context, the model M. smegmatis was chosen as a host because it is 

inAnmiK anH r^rt hp m-own to hmb cpII Hensitv fnn to 10& PFTT/ml^ in nvpr-niaht hroth 
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AGTTTGCC-3') were used to amplify by PCR the DES coding region cloned in 
plasmid pBS-des. The resulting PCR fragment was digested with BamHl and Kpn\ and 
cloned into the corresponding sites of pIPX30 to give pIPX30-DES. Protein extracts 
corresponding to the bacterial cell sonicate were prepared from M. smegmatis 
harbouring pIPX30 or pIPX30-DES, and analyzed by Western blotting using anti-DES 
mouse polyclonal serum. As reported figure 3 A, a protein band migrating at about 38 
kDa, was detected specifically in M smegmatis transformed with pIPX34 plasmid but 
not in extracts corresponding to the pIPX30 control vector. An additional 36 kDa band 
detected in both protein extracts, was attributed to the endogenous M. smegmatis DES 
protein or alternatively to a molecule cross reacting with the anti-DES mouse serum. 
The same results (Fig. 3B) were obtained with a commercially available monoclonal 
antibody directed against the (His)6 peptide, supporting the presence of six histidine at 
the carboxy terminus of DES. 

Example 7 : Identification of LHP polypeptide in short term culture filtrate (ST- 
CF). 

ST-CF was produced as previously described (Andersen et al., 1991). Briefly, 
M tuberculosis (8 x 10 6 CFU/ml) were grown in modified Sauton medium on an 
orbital shaker for 7 days. The culture supernatants were sterile-filtrated and 
concentrated on an Amicon YM3 membrane (Amicon, Danvers, MA). The ORFX 
protein was purified from ST-CF by preparative SDS-PAGE using the Prepcell system 
(BioRad, Richmond, CA). 1 ml containing 8 mg of ST-CF was applied on a matrix of 
16% polyacrylamide and separation was performed under an electrical gradient for 22 
hours. 3 ml fractions were collected and analyzed on silverstained SDS-PAGE. 3 ml 
of the fractions containing the ORFX protein was concentrated in the presence of 0. 1 
SDS in a Centricon-3 unit (Amicon) followed by acetone precipitatation. The 
precipitate was redissolved in Tricine SDS-PAGE sample buffer and loaded on a 
precast 10-20 % Tricine SDS-PAGE gel (Novex, San Diego, USA). After 
electrophoresis the gel was blotted to Problott PVDF membrane (Applied Biosystems, 
Foster City, CA) by semidry electroblotting in 10 mM CAPS, 10 % methanol, pH 1 1. 
The PVDF membrane was stained with 0. 1 % Coomassie R-250 in 40 % methanol, 1 
% acetid acid, and destained in 50 % methanol. The band of interest was excised and 
subjected to N-terrninal sequence analysis by automated Edman degradation using a 
Procise 494 sequencer (Applied Biosystems) as described by the manufacturer. 
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A-E-M-K-T-D-A-A-T-L-X-Q-E-A-G, wherein X represents any amino acid, said 
sequence corresponding to the N-terminal sequence of LHP, the methionine residue 
located at the Nm-terminal position having been naturally removed by the bacterial 
enzymatic machinery. 

5 

As it appears from the teachings of the specification, the invention is not limited 
in scope to one or several of the above detailed embodiments; the present invention 
also embraces all the alternatives that can be performed by one skilled in the same 
technical field, without deviating from the subject or from the scope of the instant 
10 invention. 
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### DNA Strider™ 1.2 m credi 2$ Juin 1997 J17:0J:42 

esat-6 am (+) 8/5/96 -> Full Restriction Map 

DNA sequence 1069 b.p. CTGCAGCMCTC ... aacgcgctgcag linear 
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318 
318 
318 
318 
318 



HinP I 
Bha I 



Mbo II 
Bbs I 




349 
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Mnl 
Mbo II 
Ear I 

I 



Sag T 

HqlA r 
SCU36 I 
flapl28fi I 
Rfln TT 
Hph I 
Bsr I Alu I 



Bar I 
I 

AGGACGACTGGGACGAAGAOGAOGACTGGTGAGCTCCCGTAATGACAACAGAC 



ScrF I 
Nci I 
Msp I 
Hp* II 
Dsa V 

Xtna T 
On* T 

ScrF I 
Bspw i 

Bgi I Msp I 
Sfi I Bpa II 

Bae III Bae III 
Gdi II Sau96 I 
Eae I BstX I 
Nci I 

Ben I 
Osa v 
BalK i 
BsaJ I 
Ben I 



M3p I 
Bpa II 

ScrF I 

Nci I 

Dsa V 

BstX I 



Ben I 



Mbo 
Bbs 



II 
I 



Ava I 
II M 

^CCCGCOCCCGAAGACTT 



TCCTGCTGACCCTGCTTCTCCTOCTGACCACTCGAGGGCATTACTGTTGTCTGAA<M 



480 



I 

407 



I I 
415 
415 



418 



I I Ml 
425 432 
4 28 

431 
431 
431 
431 
431 



III! 
456 
456 
456 
4 56 
456 
457 
457 
458 
458 
459 
459 
4 60 
460 



II II 

464 

464 

464 

464 

4 64 

465 
464 

465 
467 



I 

473 
473 



468 
4 70 
4 70 



464 

m i 464 

464 

EH 465 
j:i 465 

465 
465 
465 

HinP I 

Li Hha I 

. j Mnl I 

- J EcoN I 

L;, BspW I BspW I 

Uflafv T Mr» i t , Kbo II Fnu4R I BstO I 

W ^ a L * 1 ^ 1 Nla III Bbs I SfaN I Bair I 

^^^r 1 "™ 30 ^ 560 
^CCGTTGT AAAACCGC IXJCTf CCATTTCI C 1 1, 1 i i'LATCAGCTCGTACCCTCTCTA C IIVAXA^VACCOCCATCGGAQCGC 

4fl2 ioc * * * 1 * I * I I I • III* 

482 495 524 536 



543 



BspM 
Mnl I 
I I 



Taq 

I 



Msp I 
Hpa II 

Sau3A I 

Mbo I 

Dpn II 

Bstr i 

Alw I 
Dpn I 
II I 



536 



ScrF I 
EcoR II 
Osa V 
BstN I 
BatX I 
Taq I 

SauJA I 

Mbo I 

Dpn II 

Dpn I 



554 



546 



549 



557 
558 



554 
554 



558 
558 



I I 



BceF I 
Taq I 
Sal T 
Bine II 

Arr T 

Pie I 

Binf I BspM 
III I I 



GTCtrrCCCTrCCATTAAAGCTCGCCTAGAGGCCGCrCGACT^ 

I I I'll I * * III •III | i 

564 577 584 608 621 

567 583 60S 621 

583 608 623 



640 



630 
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Bar I 
Bspw x 
Hae III 
Sau96 I 
till 



BatO I 
HinP I 
HinP I Fau I 
flha I 
Fnu4H I 
BatO I Bspw I 
Hha I Fnu4B I Beef* 
II III I III! 



ScrF r 
ECOR* II 
BsaJ I 
BapW I 
Fnu4B I BstCJ I 
Bapw I sac II 
Bgl I NspB II 

Sfi I osa I 

Mrr T Dsa V 
Gdi II BsaJ I 

t BstN I 
Eae I BstK I 
BfllR T FDU4B I 

Hae III Hae III 

i Bsir 1 
IM III! 



HinP I Fnu4B I 

Bha I Bbv 1 

GCA<XGCCA<TTGttGCGGCQCGCCG03te 



720 



I I I I ■ 
644 
64 5 
646 
647 



III III 1 


1 1 II 1 


1 1 


mi 


653 660 


669 


677 




654 663 


671 




680 


655 


670 




681 


658 


670 


677 




653 663 


670 


677 




656 


670 




682 


659 


670 


677 






671 




682 




672 




662 




672 




682 




672 




683 



I 

692 
692 



I 

70S 
70S 



675 
676 
677 
677 

Hga I Hae III 

Aha II Sau96 I 

Msp I Mnl I 

Hpa II Taq I 

Nae I xho T 

CfrlO I Pa*»R7 T 

Sao T Hae III Ava I Mnl 

I III II I I I I I I . , , , 

: AGCAGGAACTCGACGAQVTCTCGACGAATATTCCrrCAGQC 
^CCTCCTTGAGCTGCTCTA(»CCTGCTTATAAGCACrC^^ 



Taq 

I 



Taq 
Sau3A I 
Mbo I 
Dpo II 

Bstr I 

Bol TT 
Dpo I 

I I 



Fnu4H I 
BspW I 
Fnu4H I Hae II 
Bbv I AlwN I 
I Bbv I 
II II 



800 



I 

730 



I I 
737 

736 

736 
737 
737 
737 



I 

74 7 



r*i BspW I 
: ^'lnP I 

vjlha I Mnl 

I I 



741 



III II 
758 
759 
759 
760 
760 
762 
763 



II I II- 
773 
773 
773 
774 
776 
778 
779 



I 

786 



I 

794 



I 



791 
791 



798 
800 



794 
794 



Fau I 



Nla III 
I 



Fnu4H I 
Bbv I 



CfcGCTCTTOCTCQCAAATCGQCTTCTCACC^ t ga caga gcagcag tg ga a t 

CGCCaCAGGAGCGTTTACCCGAAGACTCGGCGATTAT OCl ii 111 lU tXrTCCTTTTTCtactgtCtcgtCgtCacctta 



880 



I I 
801 
801 
803 



I 

808 



828 



I ■ 
859 



868 
868 



Fau I 
BstO I 
I I 



Fnu4H I 
BspW I 
Sac II 

NspB II ScrF I 

Dsa I EcoR II 

BaaJ I Dsa V 

FQU4B I Bst-N I 

Hae III BatLK I 

Mnl I BinP I 

Taq I BatO I Bha I BaaJ I 
I I I I I I I I I 



Mae II 
Mae III 



EcoN I 
Bair I 
Mnl I Mnl 
I I I 



AJ.WN I 



ttcgcgggtatcgaggccgcggcaagcgcaatccagggaaatgccacgtccattcattccctccxtgacgaggggaagca 
aa gcgccca tagct ccggcgccgt t cgcg t taggtccct ttacagtgcaggtaagt-aagggaggaactgctccccttcgt 



960 



I I 
883 
884 



• 1 1 1 1 1 1 1 • 

691 898 
893 
895 
896 
897 
897 
897 
897 



906 
906 



913 

913 
913 
913 
913 
913 



I 

923 



926 



I I 
940 

943 
943 



950 



959 
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ScrF I 
EcoR II 
Dsa V 
BstN I 
BSUC I 
Hae III 
NspB II 
Fnu4H I 
Bbv I BsaJ I 
Alu I Fnu4H I 



Page 5 



1040 




1038 
1038 
1040 



I If II 

cggctaccgagctgaacaacgcgctgcag 
i-gccgatggctcgacttgttgcgcgacgtc 
I II II 



1069 



1050 



1060 
1061 
1061 
1063 
1063 
1064 
1064 



Restriction Endonucleases site usage 



^Aat II 
~A.cc I 
>Afl II 
£ Afl III 

^ge I 
pAha II 

>lu I 
'Iftlw I 
^AlwN I 
Apa I 
Apa£ I 
Ase I 
Asp718 
Ava I 
Ava II 
Avr II 
BamH I 
Ban I 
Baa II 
Bbe I 
Bbs I 
Bbv I 
BceF I 
Bd I 

Ben I 
Bgl i 
Bgl II 
Bsa I 
BsaA I 
BsaB I 
BsaJ I 
Bsg I 
BsiE I 



10 



BssH II 
BstB I 
BstE II 
BstK I 
BstN I 
BstEJ I 
BstJC I 
Bstr I 
Bsu36 I 
CfrlO I 
Cla I 
Cap 6 I 
Ode I 
Dpn I 
Dpn II 
Dra I 
Dra III 
Drd. I 



Da a 
Dsa 
Eae 
Eag 
Ear 



10 



I 
v 
I 
I 
I 

EC1136 I 
BC04 7 III 
Eco57 I 
EcoK I 
ECCO109 I 
EcoR I 
EcoR II 
EcoR V 
Ehe I 
Esd I 



14 
9 

14 
1 
2 



3 
14 

2 
1 
1 
1 



Hga I 
HgiA I 
Hha I 
Hinc II 
Hind III 
Hinf I 
HinP I 
Hpo I 
Bpa II 
I 
I 
I 



Bph 
Kas 
Kpn 



Mae II 
Mae III 
Hfco I 
Mbo II 
Mcr 
Mlu 
Mme 
Mnl 



Msc 
Mse 
Map 
Hae 
Nar 
Nci 
Woo 
Mde 
Nhe 



Nla III 
Nla IV 
Not I 

Nru I 



2 

1 
17 
2 

2 
17 

14 

3 
1 

2 
2 
6 
5 
1 



14 



14 
3 
1 
5 



Pml I 
PpuM I 
PshA I 
Pst I 
Pvu I 
Pvu II 
Rma I 
Rsa I 
Rsr II 
Sac I 
Sac II 
Sal I 

Sap I 

Sau3A I 

Sau96 I 

Sea I 

ScrF I 

SfaN I 

Sfe I 

Sfi I 

SgrA I 

Sma I 

Soa I 

SoaB I 

Spe I 

Sph I 

Spl I 

Sse6337 ] 

Ssp I 

Stu I 
Sty I 
Swa I 
Tac I 



2 

1 

1 

1 
2 
1 

6 
8 

14 

2 
2 
2 
1 
1 
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Aat II 
Acc I 
Ava II 
Ban II 
Bbe I 
Bgl II 
BalE I 
BsmA I 
Bspl286 
BsaH II 
BstX I 
Csp6 I 
Ode I 
Eag I 
Ear I 
Ecll36 I 
ECO0109 
Ehe I 
Fok I 
Fsp I 
HgiA I 
Kaa I 
Mcr I 
Nar I 
PaeR7 I 
PflM I 
Pvu II 
Raa I 
Sac I 
f»i I 
I&grA I 
^£ma r 

l?p 1 
n&io i 

t>4na I 

4dwN I 
lAva I 

,-Bstr i 

'--Eae I 
II 

Hae II 

^Hga I 
ffldnc II 
,= Kinr I 

: hfae II 
^Mae III 
; Nla III 
(Pile I 

Pat I 
M&c II 
*<SiCaN I 

See I 

sfi I 

xcm I 

Aiw r 
Bha I 
BceF I 
Dsa I 
EcoN I 
Hph I 
Nae I 

Aha II 

Alu I 

Baa I 

Bgl I 

BspM I 

Bar I 

Ben I 



Site 




gaegt/c 




gt/mJcac 




g/gwee 




grgcy/c 




ggegc/c 




a/gatct 




cgry/cg 




gtctc 


1/5 


I gdgch/c 




g/cgege 




ccannnnn/ntgg 


g/tac 




c/taag 




c/ggecg 




ctcttc 


1/4 


gag/etc 




I rg/gnccy 




ggc/gee 




ggatg 


9/13 


tge/gea 




gwgew/c 





Ose Site position (Fragment length) Fragment order 



g/gegee 
c/grycg 
gg/cgee 
c/tcgag 
ccannnn/ntgg 
cag/ctg 
gt/ac 
gagct/c 
g/tcgac 
cr/ceggyg 
ccc/ggg 
aat/att 
c/tcgag 
c/ceggg 

cagnnn/ctg 
c/yegrg 
r/ga tcy 

y/ggcer 
yggccg -5/-1 
rgegc/y 

gaege 5/10 
gty/rac 
g/antc 
a/cgt 
/gtnac 
catg/ 

gagtc 4/5 
ctgea/g 
cegc/gg 
gcatc 5/9 
c/tryag 
ggcen nnn/nggee 
ccannnnn/nnnntgg 



K 11) 2 
1( 622) 1 
1( 300) 2 
1( 430) 2 
1( 285) 2 
K 735) 1 
1( 669) 1 
1( 212) 2 
1( 430) 2 
1( 158) 2 
1( 481) 2 
1( 1008) 1 
1( 214) 2 
1( 669) 1 
1( 414) 2 
1( 430) 2 
K 131) 2 
1( 285) 2 
K HI) 2 
K 323) 2 
1( 430) 2 
1( 285) 2 
1( 669) 1 
1( 285) 2 
1( 772) 1 
1( 110) 2 
1( 216) 2 
1( 1008) 1 
1( 430) 2 
1( 622) 1 
K 38) 2 
1{ 463) 2 
1( 746) 1 
K 772) 1 
1( 463) 2 



K 

K 

K 

1( 

K 

K 

K 

K 

K 

i< 

K 

K 

K . 

1( 

K 

K 

K 

K 

if 



797) 1 
463) 1 
582) 1 
457) 1 
457) 1 
285) 
762) 
191) 
165) 
12) 
9) 
523) 
165) 
0) 
681) 
248) 
0) 
458) 
110) 



12( 1058) 1 
623( 447) 2 
301( 769) 1 
431( 639) 1 
286 ( 784) 1 
736( 334) 2 
670( 400) 2 
213{ 857) 1 
431< 639) 1 
159< 911) 1 
482( 588) 1 
1009< 61) 2 
215( 855) 1 
670( 400) 2 
415( 655) 1 
431( 639) 1 
132( 938) 1 
286( 784) 1 
112( 958) 1 
324( 746) 1 
431( 639) 1 
286( 784) 1 
670( 400) 2 
286( 784) 1 
773( 297) 2 
111( 959) 1 
217( 853) 1 
1009( 61) 2 
431( 639) 1 
623( 447) 2 
39( 1031) 1 
464( 606) 1 
747( 323) 2 
773( 297) 2 
464( 606) 1 



798( 
464( 
583( 
458( 
458( 
286 ( 
763( 
192( 
166{ 
13( 
10( 
524( 
166( 
K 
682( 
249( 
K 
459( 
UK 



161) 
309) 
153) 
212) 
212) 
514) 
271) 
431) 
455) 
913) 
913) 
335) 
455) 
1063) 
215) 
294) 
1063) 
212) 
233) 



2 
2 
3 
3 
3 
1 
2 
2 
1 
1 
1 
2 

1 

1 

2 

2 

1 

3 

2 



9S9( 
773 ( 
736( 
670 ( 
670( 
800 ( 
1034 ( 
623( 
621( 
926( 
923 ( 
859( 
621( 
1064 ( 
897( 
543( 
1064 ( 
671( 
344 ( 



111) 
297) 
334 ) 
400) 
400) 
270) 
36) 
447) 
449) 
144) 
147) 
211) 
449) 
6) 
173) 
527) 
6) 
399) 
726) 



3 

3 

2 

2 

2 

3 

3 

1 

2 

2 

2 

3 

2 

2 

3 

1 

2 

2 

1 



ggatc 4/5 2 


K 


126) 


4 


127( 


141) 


3 


268 < 


315) 


3 


583{ 


487) 


1 


gaagac 2/6 1 


K 


390) 


2 


391( 


82) 


3 


473( 


63) 


4 


536( 


534) 


1 


aegge 12/13 2 


K 


626) 


1 


627( 


42) 


3 


669 ( 


371) 


2 


1040( 


30) 


4 


c/crygg 3 


K 


681) 


1 


682( 


215) 


2 


897 { 


141) 


3 


1038( 


32) 


4 


cctnn/nnnagg 3 


1( 


174) 


3 


175( 


379) 


2 


554 ( 


389) 


1 


943( 


127) 


4 


ggtga 8/7 3 


1( 


8) 


4 


9( 


237) 


2 


246( 


182) 


3 


428( 


642) 


1 


gec/gge 3 


K 


251) 


3 


252( 


111) 


4 


363( 


396) 


1 


759( 


311) 


2 


gr/cgyc 4 


K 


11) 


5 


12( 


274) 


2 


286 ( 


476) 


1 


762( 


272) 


3 




1034 ( 


36) 


4 




















ag/ct 4 


K 


217) 


2 


218( 


214) 


3 


432( 


540) 


1 


972( 


78) 


4 


g/gyrcc 4 


1050( 


20) 


5 




















K 


36) 


5 


37( 


54) 


4 


9K 


195) 


2 


286( 


80) 


3 




366 ( 


704 ) 


1 














geennon/ogge 4 


K 


25) 


5 


26 ( 


57) 


4 


83C 


377) 


2 


460< 


212) 


3 




672( 


398) 


1 




















acctgc 4/8 4 


K 


5) 


5 


6( 


175) 


3 


181( 


386) 


2 


567( 


63) 


4 




630( 


440) 


1 




















actgg 1/-1 4 


1( 


94) 


4 


95( 


312) 


2 


407( 


19) 


5 


425( 


222) 


3 




64 7 { 


423) 


1 




















ccs/gg 5 


1 ( 


303) 


2 


304 ( 


45) 


4 


349( 


107) 


3 


456( 


8) 


5 
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Fau I 
Mbo I 
Sau3A I 



cccgc 
/gate 
/gate 



4/6 



584 ( 

K 
663( 

K 
584 ( 

K 
584 ( 



24) 7 
141) 4 
165) 3 
127) 4 

24) 7 
127) 4 

24) 7 



608( 
142( 
828{ 
128{ 
608( 
128 ( 
608 ( 



129) 
6) 
56) 
94) 

129) 
94) 

129) 



737(, 333) 1 

148( 49) 6 

884 ( 186) 2 

222 ( 47) 6 

737 ( 333) 1 

222 ( 47) 6 

737( 333) 1 
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197( 466) 1 
269 ( 315) 2 
269( 315) 2 



Nla rv 


ggn/ncc 


7 


K 


36) 


6 


37( 


54) 


3 


9K 


42) 


4 


133( 


153) 2 








286 ( 


14) 


8 


300( 


37) 


5 


337 ( 


29) 


7 


366( 


704) 1 


NspB XX 


cmg/cJcg 


7 


K 


99) 


3 


100 ( 


15) 


7 


115 < 


90) 


5 


205 ( 


12) 8 








217 ( 


465) 


1 


682( 


215) 


2 


897( 


ei) 


6 


978( 


92) 4 


BsiY I 


ccannnn/nngg 


8 


K 


110) 


5 


111C 


4) 


8 


115 ( 


20) 


7 


135( 


1) 9 








136( 


39) 


6 


175C 


379) 


1 


554 ( 


123) 


4 


677( 


266) 2 








943( 


127) 


3 














Sau96 I 


g/gncc 


8 


K 


80) 


6 


81C 


52) 


7 


133 < 


17) 


8 


150( 


151) 4 








301( 


14) 


9 


315( 


152) 


3 


467( 


177) 


2 


644 ( 


134) 5 








778( 


292) 


1 














BSLN I 


cc/wgg 


9 


K 


26) 


9 


27 ( 


148) 


3 


175 ( 


143) 


4 


318( 


26)10 








344 { 


270) 


1 


614( 


63) 


6 


677 ( 


236) 


2 


913( 


71) 5 








984 ( 


28) 


8 


1012( 


58) 


7 












EC OR II 


/ccwgg 


9 


K 


26) 


9 


27 ( 


148) 


3 


175( 


143) 


4 


318( 


26)10 






344( 


270) 


1 


614 ( 


63) 


6 


677( 


236) 


2 


913( 


71) 5 








984 ( 


28) 


8 


1012{ 


58) 


7 












Taq I 


t/cga 


9 


K 


105) 


6 


106( 


118) 


3 


224 ( 


353) 


1 


577{ 


33) 7 








610( 


14) 


9 


624( 


106) 


5 


730 ( 


11)10 


741( 


33) 8 








774 ( 


117) 


4 


891( 


179) 


2 













iBkaJ I c/enngg 



< B$ck I c/cngg 



• ; &stLI I cg/cg 
f Dsa. v /ccogg 



, Hae III gg/cc 



-flpa II c/egg 



Knl I cctc 7/7 14 



Msp I c/egg 



ScrF I cc/ngg 



10 


1( 


2)11 


3( 


153) 2 


156 ( 


103) 4 


259( 


3) 9 




262( 


443) 1 


705( 


86) 5 


791 ( 


3)10 


794 ( 


74) 7 




868 ( 


109) 3 


977{ 


86) 6 


1063( 


7) 8 






10 


K 


24 1) 1 


242( 


76) 5 


318 ( 


146) 4 


464{ 


212) 3 




676( 


6)11 


682( 


215) 2 


897< 


16)10 


913( 


71) 6 




984 ( 


28) 8 


1012( 


26) 9 


1038( 


32) 7 






14 


K 


26)10 


27 ( 


148) 3 


175 ( 


129) 4 


304 ( 


14)12 




318< 


26)11 


344 ( 


5)14 


349 < 


107) 5 


4S6( 


8)13 




4 64( 


1)15 


465( 


149) 2 


614 ( 


63) 7 


677( 


236) 1 




913( 


71) 6 


984 ( 


28) 9 


1012( 


58) 8 






14 


K 


71) 6 


72< 


69) 7 


141( 


6)13 


147( 


13)11 




160( 


2)15 


162( 


198) 2 


360 ( 


15) 9 


375< 


182) 3 




557( 


97) 5 


654 ( 


5)14 


659( 


24) 8 


683< 


200) 1 




883( 


15)10 


898( 


162) 4 


1060 ( 


10)12 






14 


K 


26)10 


27 ( 


148) 3 


175 ( 


129) 4 


304 ( 


14)12 




318( 


26)11 


344 ( 


5)14 


349{ 


107) 5 


4S6( 


8)13 




464 ( 


1)15 


46S< 


149) 2 


614 ( 


63) 7 


677( 


236} 1 




913( 


71) 6 


984 ( 


28) 9 


1012( 


58) 8 




165) 2 


14 


K 


81) 7 


82( 


51) 9 


133( 


18)13 


151( 




316( 


31)10 


347( 


112) 4 


459( 


9)14 


468( 


177) 1 




64 5 ( 


. 26)11 


67l( 


9)15 


680 ( 


78) 8 


758< 


21)12 




779 ( 


116) 3 


895C 


87) 6 


982( 


68) 5 




13)12 


14 


K 


40) 6 


41( 


191) 2 


23 2 ( 


21) 9 


253 ( 




266( 


29) 7 


295C 


9)13 


304 ( 


28) 8 


332( 


17)10 




349( 


15)11 


364 ( 


93) 5 


457( 


8)14 


465( 


5)15 




470( 


119) 4 


589( 


171) 3 


760 ( 


310) 1 


400( 


18)12 


14 


K 


63) 7 


64 ( 


178) 2 


242( 


158) 3 




418( 


77) 5 


495( 


59) 8 


554 ( 


10)13 


564< 


212) 1 




776{ 


10)14 


786( 


22)11 


808 ( 


85) 4 


893 ( 


47)10 




940( 


10)15 


950( 


54) 9 


1004 ( 


66) 6 


253 ( 


13)12 


14 


K 


40) 6 


41( 


191) 2 


232( 


21) 9 




266 ( 


29) 7 


29S( 


9)13 


304 ( 


28) 8 


332( 


17)10 




349( 


15)11 


364 ( 


93) 5 


457( 


8)14 


465( 


5)15 




470( 


119) 4 


589( 


171) 3 


760( 


310) 1 


304 ( 


14)12 


14 


K 


26)10 


27 ( 


148) 3 


175( 


129) 4 


318( 


26)11 


344 ( 


5)14 


349 ( 


107) 5 


456( 


8)13 




464( 


1)15 


465( 


149) 2 


614 ( 


63) 7 


677< 


236) 1 




913( 


71) 6 


984 ( 


28) 9 


1012( 


58) 8 







Hha I gcg/c 



HinP I g/ege 



17 



17 



K 
159( 
325( 
653( 
906 ( 

K 
159( 
325( 
653( 
906 < 



72) 7 

2)18 
36)10 

5)17 
155) 2 
72) 7 

2)18 
36)10 

5)17 
155) 2 



73( 
161( 
361( 
658( 

1061( 
73( 
161{ 
361( 
658< 

1061( 



67) 8 
18)12 
15)13 
34)11 

9)15 
67) 8 
18)12 
15)13 
34)11 

9)15 



140( 6)16 

179 ( 108) 4 

376( 182) 1 

692( 109) 3 

140{ 6)16 

179< 108) 4 

376( 182) 1 

692( 109) 3 



146( 13)14 

287( 38) 9 

558( 95) 6 

801< 105) 5 

146{ 13)14 

287( 38) 9 

558( 95) 6 

801C 105) 5 
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c3at-6 am (+) 8/5/96 -> Full Rf tion Map 

977 ( 3)26 980(' 83) 5 I063( 7)19 

435 sites found 

No Sites found for the following Restriction Endooucleases 



25/06^7 17:03:42 



Page 8 



Afl II 
Afl III 
Age I 
Apa I 
ApaL I 
Ase I 

Asp718 
Avr II 
BarnH I 
Bel I 
Bsa I 
BsaA I 
BsaB I 
Bsg I 
Bsm I 
Bspl20 I 
BspE I 
BspH I 
BstB I 
Sst£ II 
Bsu36 I 
Cla I 
Dra I 

pflra III 

MDrd I 



1/5 



c/ttaag 
a/crygt 
a/ceggt 
gggec/c 
g/tgeae 
a t/taa t 
g/gtacc 
c/ctagg 
g/gatcc 
t/gatca 
ggtctc 
yac/gtr 
gatnn/nnatc 
gtgcag 16/14 
gaatgc 
g/ggece 
t/cegga 
t/catga 
tt/cgaa 
g/gtnacc 
cc/tnagg 
at/cgat 
ttt/aaa 
caennn/gtg 
gaennnn/nngtc 



Eco4 7 III 
EC057 I 
EcoR I 
EcoR V 
Esp I 
Fse I 
Gsu I 
Hae I 
Bind III 
Hpa I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 



Kpa 
Kiu 
Hne 

HSC 

Hse 
Nco 
Nde 
Nhe 
Not 
Nru 
Nsi 
Hsp 



Nsp7524 
HspC I 



agc/gct 
ctgaag 16/14 
g/aattc 
gat/a tc 
gc/tnagc 
ggccgg/cc 
ctggag 16/14 
wgg/ccw 
a/agctt 
gtt/aac 
ggtac/c 
a/cgegt 
tccrac 20/18 
tgg/cca 
t/taa 
c/catgg 
ca/tatg 
g/ctagc 
gc/ggccgc 
tcg/cga 
atgea/t 
rcatg/y 
r/catgy 
rcatg/y 



Pac I 
Pml I 
PpuM I 
PahA I 
Pvu I 
Rma I 
Rsr II 
Sap I 
Sea I 
Sna I 
SnaB I 
Spe I 

Sph I 

Spl I 

Sse8337 

Stu I 

Sty I 

Swa I 

Tfi I 

Tthlll ] 

Ttnlll ] 

xba I 

Xca I 

Xran I 



ttaa t/taa 

cac/gtg 

rg/gwecy 

gaenn/nngtc 

cgat/cg 

c/tag 

cg/gwecg 

gcttcttc 1/4 

agt/act 

gta/tac 

tac/gta 

a/ctagt 

gcatg/c 

c/gtacg 

ectgea/gg 

agg/cct 

c/cwwgg 

attt/aaat 

g/awtc 

gacn/nogtc 

caarca 11/9 

t/ctaga 

gta/tac 

gaann/nnttc 



A 
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ScrF I 
EcoR II 
Dsa v 
3s cN r 
3s cX I 
Rsa I 

Csp6 I M^ e II 
Nla rv Aha ri 
X3n T -ar rr 

3an I Hph I 

Asa 7 1 3 Ma« m 

I I II t f 



Hpa II 

Serf I CCrlO I 

EcoR II Sctta r 
Dsa V Bspw i 

ascN I Mia IV HiriP I 

3scX I Ban I Hha I 

Sspw I Fnu4H I Fnu4H I Fnu4H I 

Bgl I SspW I Fnu4H I KrU r aszU I Sau96 

M | If II | | t t III | 

^^O^COC^CCCAGCCCACC^GCUACCCCCOC^CATC 80 



ggaaccAGgTCAC C T CC L 1U XUIA GCCAG wwww 



/ I 
9 

8 



2 

2 ~ * 



r:5 



12 



I I 

25 

25 
26 
26 
26 
26 
26 



I 

34 
34 



I I I J 



43 



3o 
35 
37 
33 
39 
40 
40 



46 



I 

63 



69 



I 

71 



72 
72 



I 

80 



Bspw 

acik: 

Hae^j: 

t f 



FoJc 



NspB II 



*V Taq I asiY 

Sa--. I Nspa II 3*iY I 

E?fET^- ^^Sgggag !CCOc mLiAACCA-nr cuc ioc^ io:iu^ tcagocccc*gc^ : <1c l uli j 

CCCG^CGAGCCCrrCC It-AGGCGACA GC nU^TA CGCCACCGACCACCTACTC^ 



5au3A 
t-Oo I 
Dpn II 
Dpn I 
Alw I 
I I It 



Sau96 I 
Nla IV 
Hae III 



Fau 



c ascu i 
HinP I 

Hha I 2scU I 
3siY I Kir.P I 
Hair I Hha I 

II M I 1:1111 



3spw I 
Kae III 
SauS6 I 
au I 



HinP I 
Hha I 
ascU I 

HinP I 

Kha I 



~nu4H 
35v I 

I I 1.1 



160 



I I 

81 
82 
82 



99 



94 



105 



r ( 
110 

110 
110 

111 



114 



114 



I 

126 
127 
127 
127 
127 



I I 



I f 
134 
135 



I I I 



3scU I 
I I 

CCCCGACTCCC7ACCT 



BspM 

HinP I 
Hha I 
ScrF I 
£coR II 
ScoN I 
Dsa v 
33CN I 
3SCK I 
3siY I 



131 
132 
132 
132 



139 
139 
140 

141 



III (If 
145 
14 = 
146 
147 
149 
150 
151 



I III 

155 

155 

158 
158 
158 
159 
160 
160 



Taq 
Sau3A 
Mbo I 
Alu I 

FMi rr 
Nspa ii 



NspB II Me I Dpn 
Fnu4H I 3sznA I Dpn 

I ' I ll I I I I r | 

GCGOrrr-Art^-"^ -^ f-'C^C C +^ ^ i IV^CCCCAaXXGCTt^ltTTCTCACCT^ I'iU-CC 240 

, , "^^^?0i^COCCTCCACCCACCAACTOCGCC7a :a ^ . . . 1LUX XAACGGC 

1 ' ' • ' I -II l I f l 'll || 



MSp 

Hpa 
CfrlO 
I 



I 

II 



161 



174 



191 



203 



212 



221 



230 
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Seq Insert pIPX26 -> Full Restrict.. Map 19:13:21 Page 



rnu4H r 

'Hin? I ScrF r 

Hha I Nci I _^ 

ma rv hsp r sc=^ 1 



Fnu4H I 
BspW I 



£ II Hae IX Sau96 I |scN I 

w„h r £L ri Alwl r tua rv SsaJ I 

Mrl f S a* T Ms^I ^ III SaTl Msp I BacK I Hae III 

BsaJ I CCrlOI Hp! II HphI Atall Hpa II ^ Ben Sau9SI 

J ' ' ' * 26S * 275 " 8S' * 94 " 303 214 

"! »L 111 m 101 "J« 



24S 252 267 

251 258 

252 2S8 

255 268 

255 268 



285 300 317 

295 300 317 

285 303 317 

285 303 317 

285 303 317 

286 303 

286 303 
238 



scrF I 
Nci I 
Msp I 
Hpa II 
Dsa V 

9s cx r 

Ben I 

Xma I BspW 1 

Sma I Ml* ^ 

ScrF I 34X1 1 

Nci'l 1 

Dsa V Hpa II 



3s cK I Naa I 

Hi aspw i asiY i cfrio r 

^ Mia IV BsaJ" I SspW I 

:au4H I 3cn I KinP I KinP l 

w*r, r iva I KTxa I Hha I »s - 

r; II Xczn I B«U I ( f BSCCJ I Hbo II «nl 

^ ■ ' " • J . " * IL 11 M ' J71 * 387 399 



400 



331 343 359 



331 347 



360 37S 390 

-334 347 - 360 37S 390 

336 347 362 

337 347 362 

347 362 
347 363 
347 363 
347 365 
347 365 
347 366 
348 
348 
348 
348 
348 
348 
348 
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„ . -. vr ' 8/05/96 19:18:21 Pa S e 3 

Stq Insert pIPX26 -> Full Restriction Map 

scrr I 
Nci I - 

MSp I 

Hpa II 

Dsa V 
Xrrva I 
Sma I 
Serf r 

BspW I 

Bgl I Msp I 
SCi I Hpa II 

Hae III Hae III 
Cdi II Sau9S I 
Eae I BscK I 
Msp I Nci I 
Hpa II 3cn I 



g^TnTfi r *** i Dsa v 

r " Dsa V BsaJ I 



"Ci 1 ^n 3 ?" 3scK I Ben I Mbo II 

Mbo II hph I Ben I Ava I 3bs I 

Bsr I Ear_I asr I Alu (|||| n , , 

1 1 1 - ^ I ,,-J.,~ T *l l/vTCCCGT v^TCACAAO^GA i,! l ^ CCCGC CACOCGGOCOGSA^A^ACTro 480 

I • I I • I I 'I , * 4S5 463 472 

406 414 424 431 4SS 4fi3 4 72 

414 - 427 >,n 455 463 



417 430 
430 
430 
430 
430 



455 463 

455 463 

456 464 

456 463 

457 464 
457 4S6 



f"*i 458 467 

^ 458 469 

U 459 469 

m 459 

463 

ftj 463 

463 
464 

{j ! 464 
TV 464 

464 
464 

Mnl I 

r i EcoN I 

asiY I 

^ Hbo II Fnu4H I Ava I 

Nla III Bbs I S£aN I 3s*-T I 



<^atttt^^ 5 °° 

C36gTCTAAAACCGCTCCTTCCX ri - lL 11 II 1 i ICA TCAC^rCCTACXu iu ic I Ac i ^ 1V ^ L . , , 

K.l ' * * 523 " 535 542 553 

481 494 5 " l 2 - 545 554 " 



553 
553 
553 



H3p I 

"?* 11 BscK I Taq I 



Serf I 
EcoR II 
Dsa V 

BscN I BceF I 



^l^ 1 raql^I Sff\ BS^I 

AciAcicAOTI^ 640 

cot 609 62* 

- -S53 5" 



59 



Scq Insert pIPX2<J -> Full Restriction Map 



3/05/96 19:18:21 Pa* 



asr I 

BspW I 

Hae III 
Sau96 I 
I I I I 



Fnu4H I 

aspw i 
agi r 
sci r 
mcx r 

Cdi II 
Sag I 
Sae I 

asiZ I 



ScrF I 
EcoR II 
3saJ I 
Bspw I 



Bsca r 

Sac II 
Nspfl II 

Osa I 

Osa V 

EsaJ I 
ascN i 

BscK I 

Fnu4H I 
HI 



3scU I 
Hin? I 
HinP I Fau 
Hha I 
Fnu4H I 

3scu I 3sp-:-; r Ha* III Hae 
Hha I Fnu4H * 3ceF I 3siY I 

III III I I II I III I II I 

CAGGGCCAGTCGCCraSCCGCCGC^^ ITalGCTTCCAACAACCAGQCAATAAGCAGAA 
GTCCCGCrrC*CCGCGCCGCCXCGCCCCri^C^ 



HinP I 
Hha I 

I 



Fnu4H I 

abv r 

i 



720 



1 1 1 1 


• III III- 1 


1 1 1 1 


III 1 1 1 1 -1 


643 


652 659 


668 


676 691 


644 


653 662 


670 


679 6S1 


645 


554 


669 


680 


646 


657 


669 


676 




652 662 


669 


676 




657 


669 


631 




658 


669 


576 






670 


681 






671 


681 






671 


681 






671 


682 



I 

704 
704 



Taq 
r 



Taq 
Sau3A I 
Mbo I 
Opn II 

3scY r 

3ai — II 



674 

. 675 
676 
676 



Dpn 

I I 



I 



CXSA^XAACTCCACGAGATCTCGACCAATATTCG 



H<ya I 
Aha II 

msp r 
Hpa II 
Nae I 
C£rl0 I 
Hae III 
Ml II 



Hae III 
Sau96 I 
Mill I 
Taq I 

Xho T 

o^«*Q7 T 



Ava I 

Mill 



Mnl 

I 



HinP I 

Fnu4H I 

3spW I 
Fnu4H I Hae II 
Sbv I AlwN I 
I 3£tv i Kha I 

II 111 



TACTCGAOOOCCGACGAOGAGCACCACCAOC 800 



Cu^i. i luACC llX.lUTAa\OCTCCrrATAACCAG L , CClA>CLCXI\<X7rrATGAGCI^^ 1LU 11U ILU 1U. 



729 



I I 
735 

735 

735 
736 
736 
735 



746 



Mnl 

I 

I 



740 



II I - M 
757 
753 
758 
759 
759 
761 
762 



I I 



Fau 

I 



I I I 
772 
772 
772 
773 
775 
777 
778 

Nla 

I 



I 

78S 



790 
790 



I 

753 



753 
753 



I I I 
800 

797 
799 



800 



III 



Fnu4H 
abv i 

I 



rcc^cAasAcccrrrrAccccAA^ ii v ii rri^LL m nrnuTACK, n iioiioi^ accttaa 



802 



807 



827 



I • 
8S8 



Fau I 

3scu r 
i i 



Fnu4H I 
BspW r 
Sac II 
NspB II 
Osa I 
BsaJ" I 
Fnu4H I 
Hae III 
Mni I 
Taq I BscU I 
1 I Mill 



I • 

867 

867 



HinP I 
Hha I 
I 



ScrF I 
EcoR II 
Osa V 
BSCN I 
BscK I 

BsaJ- I 

I 



Mae II 
Mae III 

I I 



EcoN 
BsiV 
Mnl I 
I I 



Mni I 



AlwN 

I 



960 



882 
393 



I I 

890 
S?2 
334 
395 
396 
896 
896 



I 

897 



905 
905 



I 

912 

912 
912 
912 
912 
912 



I I 
922 

925 



I • I 
939 

942 
942 



949 



I 

958 
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Seq insert pIPX26 -> Full Restrict!*,.. Map 
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Alu 
I 



Scrr 

zcor xx 

OSA v 
3s r 

35 CK I 
Hae XII 

Nspa ir 

Fnu4H I 
Bbv r 3saJ I 
I Fnu4K I 
t I t I I 



Mnl 
I 



Serf I 
EcoR II' 
Osa V 
3s CM X 
2s cX I 
Rsa I 
Cspo I 
I 3saJ X 
t I 



Beef 
Osa X 
BsaJ X 
Hya I 
Ana IX 
I ( t 



TCCCTCIACCAACCTCGCAG^^ rGlC C ftCC^AAAATSGSACGCCAC L040 



971 



II I • I 
979 

975 
976 
977 



:CCSArCCCGCCATCGCXAAGCCTCCOCA TO: lUJLA O U^lUjlTm 'XOCCTGCOCTG 



983 



1003 



981 
983 
983 
933 
983 
983 



I • I 
1011 

1003 

1008 
1011 
1011 
1011 
1011 
1011 



I II- 

1033 
1033 

1037 
1037 
1039 



AlwN X 



Alu 
f 



Fnu4H 
Bbv I 
HinP I 
Hha I 
BscU I 
tin i 



BscU I 
HinP I 
Kha X 

ScrF X Msp I 

EcoR IX Sau3A X Hpa IX 

Osa v Mbo X CfrlO X 

ascN I Dpn XI aspw i 

3scK I Dpn I agl I 

I I I 



Mcr I 
BsiS I 
TaQ X 



GCC^CCCAGCTCAACAACOCarroCAG^ 1120 



CCC^ICCCTCGAi. i i^l'lUXrGACGTCTTOGACO 



1049 



I I I I I 
1059 
1060 
1060 
1062 
1062 
1063 



• I 
1071 
1071 
1071 
1071 
1071 



I I 



1 

1083 
1083 
1083 
1083 



I I 

1093 
1093 
1093 
1094 
1094 



1063 
1066 



107S 
1075 
107S 



3sr I 

w Mae rrr 

=Mfce II Fok 

r ; i i i 



BspW 
I 



3SCU 
I 



TCCGTEACOCAAOCTSaCTTC 
• I I 
1111 
1112 
1112 



Sau3A I 
Mbo X 
Dpn XI 

Dpn I 3siY I 

Alw r Taq I 

I I 



aL^GTCACraX^TTTTCGC^ 1200 
CCTir^ACTTlACCCTACAACX^ 



M 1 I f 
ii26 

":l 1129 



t 

1133 



I 

1153 



■ 1161 



SsraA 
9s* T 

I I 

TTCCCTCCCTCTCCCO 



HinP I 
Maa XI Hha X Cde 
I t I 

lTA CG 1TIU AC^GCA C1C11 AGA CI,TUV 



I 

1182 
1183 
1183 
1183 
1183 



Hae HI 
Mcr I 
Cdi II 
Sag I 
Eae I 
BsiE X 
Fnu4H 
III 



I 



119S 



1199 



5au3A J 
Mbo I 
Dpn IX 
Dpn I 

Alw I 
Nla IV 
Bscr I 




1280 



I I * 
1208 
1209 



I 

1232 



I 

1240 
1240 



I • I 
1247 

1251 



12S9 



I I I 

1263 
1264 
1264 
1264 
1264 
1264 
126S 



ATCCTccca 
III- 

1275 
1277 
1277 
1277 
1277 
1278 
1278 
1278 
1278 
1278 



Seq Insert pIPX26 -> Full Reitric^on Map 



Ail II 

Ail rrr 

Age I 
Aha II 
Alu I 
Aiw r 
AlwN I 
Apa r 
ApaL r 
Ase r 
Asp718 
Ava r 
Ava II 
Avr II 
3axnH I 
San I 
Han II 
=oe I 
=bs I 
E^v I 

3ceF I 

3cl I 

Ben I 

3*1 I 

3gl II 

3sa I 

3saA I 

3sa3 r 

3sa*J I 

3sg I 

5si£ I 

3siY I 

am i 

3SriA I 
35^120 I 
3^31295 I 
5SpE I 
35§H I 
-S?M I 
333^W I 
35* I 



ascE rr 

BSCX I 

bscn i 
4 Bsca I 

4 S3CX I 
6 BscY I 

3 9su36 I 
CSrlO I 
Cla I 
Cspfi I 

1 Dele I 

4 Dpn I 
1 Dpn II 

Ora I 

1 Ora III 

5 Ord I 
1 Dsa I 
1 Osa V 
3 Eaa I 
7 Eag I 
3 Ear I 

Ecll36 I 

6 Eco4 7 III 
S Eco57 I 

1 EcoN I 
1 EcoO109 I 
EcoR I 
EcoR II 
12 EcoR V 
Ehe I 
3 Esp I 
10 Fau I 
Fse I 

2 Fnu4H I 
Fok I 

1 Fsp I 
Cdi II 
Gsu I 

3 Hae I 
19 Hae II 

S Hae III 



Hha I 
IS Hinc II 
10 Hind III 
IS Hint I 

1 HinP I 
3 Hpa I 

Hpa II 
S Hph I 
Kas I 

2 Kpn I 

2 Mae II 
9 Mae III 
9 Mbo I 

Mbo II 
Mcr I 
Mlu I 

3 Mroe I 
IS Mnl I 

3 Msc I 

2 Mse I 

1 Ksp I 
1 Nae I 

Nar I 

Nci I 

3 Nco I 
1 Nde I 

Nhe I 
10 Nla III 
Nla IV 

1 NOC I 

Nru I 

6 . Ssi I 

Nsp I 
24 Nsp7S24 I 

2 NspS II 
NspC I 

3 Pac I 
PaeR7 I 
PflH I 

2 Pie I 
14 
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17 PshA I 

2 Psc r 

pvu, r 
2 Pvu ri 

17 Rma I 
Rsa I 

IS Rsr II 

4 Sac I 

I Sac II 

1 Sal I 

4 Sap I 

4 Sau3A I 
9 Sau9S I 

5 Sea I 

3 ScrF I 
SCaN I 
SCe I 
IS Sd I 
SgrA I 
Sma I 
15 Sna I 
3 SnaB I 
1 Spe I 
6 Sph I 
Spl I 
Sse8337 I 
Ssp I 
3 Scu I 
9 scy I 
Sua I 
Taq I 
Tffi I 
Tthlll I 
Ttiilll II 
7 Xba I 
xca I 
Xan I 
1 Xho I 

1 Xma I 

2 Xmn I 
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1 

1 

2 

1 
2 
1 

9 
3 

16 
2 
1 
2 
1 
2 



11 



Sice 



Use Sice position (Fracmenc lengch) Fragmenc order 



Aacjll 
Acc * I 
Asp.713 
Ava:.* II 
Bat&i I 

abe' t i 
agt.ill 
3sa I 
Bspl286 
EssH II 
3SCX I 
Ear I 
EC1136 I 
EcoO109 
"he I 
.giA I 
Kas I 
Kpn I 
Nar I 
PaeR7 i 
P£LN I 
Psc I 
Pvu ir 
I 
I 
I 



Sac 
Sal 
SCe 
SgrA 
Ssp I 
Xho I 



BsmA I 
Csp6 : 
Dde I 
Eag r 



gaegc/c 
g- /.nJcac 
g/gcacc 
g/gwee 
g/gaccc 
grgcy/c 
ggegc/c 
a/gaccc 
ggcccc 1/5 
I gdgch/c 
g/cgege 
c cannnnn / n c gg 
cccccc 1/4 
gag/ccc 
I rg/gnccy 
ggc/gee 
gwgew/c 
g/gegee 
ggcac/c 
gg/cgee 
c/ccgag 
ecannnn/negg 
cegca/g 
cag/ccg 
gagec/c 
g/ccgac 
c/ tryag 
cr/ceggyg 
aac/acc 
c/ccgag 



gcccc 
g/cac 
c/ cnag 
c/ggecg 



1/5 



1( 10) 2 
'1< 621) 2 
1 t 0)2 
1( 299) 2 
1 ( 1276) 1 
U 429) 2 
If 2S4) 2 
1< 734) 1 
If 1207) 1 
1( 429) 2 
It 1S7) 2 
If 430) 2 
K 413) 2 
If 429) 2 
If 130) 2 
If 284) 2 
If 429) 2 
If 284) 2 
1( 0) 2 

If 284) 2 
If 771) 1 
If 109) 2 
1( 1062) 1 
If 215) 2 
If 429) 2 
If 621) 2 
If 1062) 1 
If 37) 2 
If 745) 1 
If 771) 1 

K 211) 2 

If 1) 3 

If 213) 2 

If 653) 1 



11 ( 1272) 1 
622 f 661) 1 
If 1282) 1 
300( 983) 1 
1277( 6) 2 
430( 853) 1 
28S< 998) 1 
735f 548) 2 
1208 ( 75) 2 
430f 853) 1 
158 f 1125) 1 
481( 802) 1 
414{ 869) 1 
430( 853) 1 
13K 1152) 1 
28S( 998) 1 
430( 853) 1 
285f 998) 1 
K 1282) 1 
285( 998) 1 
772 { 511) 2 
110 ( 1173) 1 
1063 f 220) 2 
216 f 1067) 1 
430f 853) 1 
622 ( 661) 1 
1063 ( 220) 2 
38( 1245) 1 
746( 537) 2 
772f 511) 2 

212( 997) 1 

2 ( 10O6> 1 - 

2l4f 1033) 1 

669 f 595) 2 



1209f 74) 3 

1008f 275) 2 

I247< 36) 3 

1264( 19) 3 
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Seq Insert pIPX26 -> Full Restn^on Map 



&na r 
xcm r 
xma I 

AiwN r 
£bs I 
SceF r 

asiz r 

2spM r 

ascr i 

Dsa £ 

Eae I 
EcoN I 

ecu ri 
Mcr r 
Nae r 

ma rri 
Aha rr 

Aiu I 

Ava I 

Hph I 
Mae II 
Mae XII 



ccc/ggg 2 
ecannnnn/nnnnegg 2 

c/ceggg 2 

cagnnn/ccg 3 

gaagac 2/6 3 

aegge 12/13 3 

cgry/cg 3 

acccgc 4/8 3 

r/gaccy 3 

c/crygg 3 

y/ggcer 3 

ccenn/nnnagg 3 

ysgccg -5/-1 3 

c/grycg 3 

gec/gge 3 

cacg/ 3 

gr/cgyc 



ag/cc 
c/yegrg 
ggega a/7 
a/cgt 
/genac 



1 ( 
1( 
If 

1< 
K 
i( 
K 
1 ( 
1( 
1( 

If 

1( 

1( 

If 

If 

If 



346) 2 
109) 3 
346) 2 

796) 1 
389) 2 
625) 1 
668) 1 
179) 3 
SSI) 

680) 

4S6) 

173) 

456) 

668) 

250) 

522) 



1 
1 
2 

4 

2 

1 

3 

1 



4 


K 


10) 


5 




1033 { 


250) 


4 


4 


1( 


216) 


3 




1049 ( 


234) 


2 


4 


K 


346) 


2 




772 ( 


511) 


1 


4 


K 


7). 


5 




427 ( 


856) 


1 


4 


1( 


11) 


5 




1232 ( 


51) 


4 


4 


1( 


8) 


5 




U26< 


157} 


4 



Ban 


I 


5/gyrcc 


5 


1( 


0) 


6 










285C 


80) 


3 




r 


gc ennnn / nggc 


S 


1( 


24) 


6 


3sr : 








571 ( 


422) 


1 


r 


acegg 1/-1 


S 


K 


93) 


5 










646( 


483) 


1 




I I 


gaaga 8/7 


5 


If 


386) 


2 










472 ( 


63) 


3 



airy 1 
cfrlo I 



Nd! 1 



3&y. r 



ggacc 

ccs/gg 

r/ceggy 

cccgc 

cc/sgg 



gcagc 



4/5 



4/6 



8/12 



K 
1IS2 ( 
If 
455 ( 
1( 
362( 
If 
662 ( 
If 
455 f 



125) 4 



95) 
302) 
8) 
38) 
396) 
140) 
165) 
302) 
8) 



NspB II cmg/Ckg 
Sau9o I g/gncc 



If 154) 3 

793( 74) 7 

If 98) 4 

216( 465) 1 



347( 116) 3 
HOf 233) 2 
347{ 116) 3 
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797 f 
390( 
S26f 
669 f 
130 ( 
582 f 
681 ( 
457 f 
174( 
457 f 
559 ( 
251 ( 
523 f 



161) 3 
82) 3 
42) 4 
443) 2 
386) 2 
153) 3 
215) 3 
212) 3 
379) 2 
212) 3 
443) 2 
111) 4 
335) 3 



11 f 274) 2 

217( 214) 4 

347 ( 116) 4 

8f 237) 2 

12( 913) 1 

9f 267) 2 



If 
355 f 
2Sf 
1092 ( 
94( 
1129 ( 
387( 
53Sf 

126 ( 
1277 ( 

303 < 
463 f 

39 f 
758 f 
141( 
827 ( 
303 f 
463 f 



35) 5 
918) 1 

57) 5 
190) 4 
312) 2 
154) 4 

3) 6 
748) 1 

141) 3 
1) 7 
44) 4 
1) 
191) 
335) 
6) 
56) 
44) 
1) 



7 
3 
2 

7 

5 

4 

7 



463 f 920) 1 
143 ( . 940) 1 
46a< 820) 1 



953 ( 
472 ( 
663 ( 
1112 ( 
566 ( 
735 ( 
896 ( 
569 ( 
S53( 
669 ( 
1112 f 
362 ( 
853 ( 



108) 4 
63) 4 

371) 2 

152) 
63) 
542) 
141} 
595) 
389) 
595) 
152) 3 
396) 2 
401) 2 



3 
4 
2 
4 
1 
1 
1 



28S( 476) 1 
431 f 540) 1 



453 f 
245 ( 



91) 5 
30) 4 



1065 ( 


217) 2 


S3Sf 


748) 1 


1039( 


244) 3 


1264 ( 


19) 4 


629 ( 


654) 1 


1277 ( 


6) 4 


1037( 


246) 2 


1264 ( 


19) 4 


942( 


341) 3 


1264 { 


19) 4 


1264 f 


19) 4 


7S8( 


525) 1 


1259 ( 


24) 4 


761 ( 


272) 3 


971 { 


78) 5 


554 ( 


216) 3 


275 ( 


152) 3 



925 ( 199) 2 
276( 646) 1 



1124 ( 108) 3 
922 ( 204) 3 



36( 54) 4 90( 195) 2 

32( 377) 2 4S9( 212) 3 

405( 18) 6 424f 222) 3 

390f 24) 5 4l4f 58) 4 



257 ( 
1278( 
347{ 
464C 
230( 
1093 ( 
147( 
833f 
347 C 
464 f 



315) 2 
5) 6 
1) 
819) 
21) 
190) 
49) 
400) 
1) 



6 
1 
7 
4 
6 
2 
6 



819) 1 



Opn : 
opn rr 

Nla rv 
Sau3A r 

3sir : 

BSCN r 
ScoR II 



ga/ic 



/gacc 



/gacc 



ggn/ncc 



/gacc 



ccnnr.nr./ nngg 



/cc^g 



8 


If 


79) 6 


80 f 




300f 


14) 9 


314 f 




777 ( 


506) 1 




9 


If 


126) 4 


127f 




583 f 


24) 9 


507 ( 




1183 f 


95) 6 


1273 ( 


9 


If 


126) 4 


127( 




583 ( 


24) 9 


607 ( 




1183 ( 


95) 6 


1278f 


9 


If 


126} 4 


127 ( 




583 f 


24) 9 


607 ( 




1183 ( 


9S) 6 


1278 ( 


9 


If 


0)10 


If 




132 f 


1S3) 2 


28S( 




365 ( 


912) 1 


1277f 


9 


If 


126) 4 


127 ( 




583 ( 


24) 9 


607 ( 




1183 ( 


95) 6 


1278{ 


10 


If 


109) 6 


110( 




135( 


39) S 


174 ( 




676 f 


266) 1 


942( 


10 


K 


4)11 


5( 




317( 


296) 1 


613 f 




983 ( 


23) 9 


101K 


10 


If 


4)11 


Sf 




317( 


296) 1 


613 ( 




983 ( 


28) 9 


1011 f 



155 f 549) 1 

867 f 109) 4 

99f IS) 7 

681 f 215) 3 

52) 7 
152) 3 



94) 7 
129) 3 
5)10 
94) 7 
129) 3 
5) 10 
94) 7 
129) 3 

5) 10 
35) 6 
14) 8 

6) 9 
94) 7 

129) 3 
5)10 

4) 10 
173) 4 
257) 2 
21)10 
63) 7 
60) 3 
21)10 
63) 7 
fin i q 



704 ( 86) 5 

976( 86) 6 

114 ( 90) 5 

896f 81) 6 

132 ( 17) 8 

466( 177) 2 



582 { 600) 1 

348 C 107) 3 

2S1( 111) 5 

196 ( 466) 1 

348 { 107) 3 



790 ( 3) 8 

1062( 221) 2 

204 ( 12) 8 

977 ( 306} 2 

149 ( 151) 4 

643( 134) 5 



221f 
736 ( 

221( 
736f 

221 f 
736( 

36( 
299 ( 

221 ( 
736 ( 



47) 
347) 

47) 
347) 

47) 
347) 

54) 
37) 

47} 
347) 



8 
1 

8 
1 

8 
1 

3 
5 



268 f 
1083 ( 

263 < 
1083 ( 

268 ( 
1033 { 

90 ( 
336 { 

268{ 
1083 ( 



31S) 2 
100) 5 

315) 2 
100) 5 

315) 2 
100) 5 

42) 4 
29) 7 

315) 2 
100) 5 



114 ( 
347f 
1199f 
26 ( 
676 f 
1071 ( 
26( 
676 ( 



20) 9 

206) 3 

84) 7 

143) 4 

236) 2 

212) 3 

148) 4 

236) 2 



134 ( 
553 ( 

174 ( 
912 f 

174 { 
912 ( 



1)11 
123) S 

143) 5 
-*U 6 

143) 5 
71) 6 
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Seq Insert plPX26 -> Full Restriction Map 



Hae III 



3s IX I 



Dsa V 



L. ! 

s<z'£'t 







1037 ( 


246) ' 1 






gg/ cc 


14 


1 ( 


80) 7 


81( 


51) 9 






315( 


143) 4 


458 ( 


9)14 






670 ( 


9) 15 


679 ( 


78) 8 






894 ( 


87) 6 


981 ( 


284) 1 


eg /eg 


IS 


1 ( 


70) 7 


71 < 


69) 3 






159 ( 


2)16 


161 [ 


198) 3 






553 ( 


5)15 


658 ( 


24) 9 


c/egg 




897( 


162) 4 


1059 ( 


17) 10 


15 


1 ( 


39) 7 


40( 


191) 2 






26S< 


29) 3 


294 ( 


9) 14 






34S( 


15)12 


353 ( 


93) 6 






469 ( 


119) 5 


588 ( 


171) 4 


c/egg 


15 


1C 


39) 7 


40< 


191) 2 






265( 


29) 8 


294 ( 


9)14 






348( 


15)12 


363 ( 


93) 6 






469( 


119) 5 


588 ( 


171) 4 


c/cr.gg 


16 


H 


4)15 


5t 


21)12 






303 t 


14)13 


317( 


30)10 






4SS( 


8)14 


463 ( 


1)17 






676 ( 


236) 1 


912 ( 


71) 7 






1071 { 


212) 2 






/ccngg 


16 


1( 


4)15 


S( 


21)12 






303 ( 


14) 13 


317( 


30)10 






455C 


3)14 


463 ( 


1)17 






676 ( 


236) 1 


912 ( 


71) 7 






1071 ( 


212) 2 






cccc 


7/7 16 


K 


62) 7 


63 ( 


178) 3 






417( 


77) 6 


494 { 


59) 8 






775 { 


10)15 


78S( 


22)12 






939 ( 


10)16 


949 ( 


54) 9 






1275 ( 


8) 17 






cc/ngg 


15 


1 ( 


4) 15 


S( 


21)12 






303 ( 


14)13 


317 f 


30)10 






4SS( 


3) 14 


463 ( 


1)17 






676f 236) 1 


912 ( 


71) 7 






107l( 212) 2 







132 ( 18)12 

467{ 177) 2 

7S7( 21)11 

1265( 18)13 



140( 
359( 
682 ( 
1076 ( 
231( 
303 ( 
4So< 
759 ( 
231 < 
303 ( 
45£< 
7S9( 



6)14 
15)11 
200) 2 
85) 6 
21)10 
23) 9 
8)1S 
335) 1 
21)10 
23) 9 
8)15 
335) 1 



26 ( 148) 4 

347{ 1)16 

454 (' 149) 3 

983 ( 28)11 
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150 ( 165) 3 
644f 26)10 
778 f 116) 5 



146( 


13)13 


374 ( 


279) 1 


882 ( 


1SJ12 


L161C 


122) 5 


252 ( 


13)13 


331( 


17)11 


464 ( 


5)15 


1094< 


189) 3 


252 ( 


13)13 


33K 


17)11 


464 { 


5)15 


1094 ( 


189) 3 



174 ( 129) 5 

348( 107) 5 

613( S3) 9 

101K 60) 9 



26( 


148) 4 


174 ( 


129) 5 


347( 


1)16 


348( 


107) S 


464 ( 


149) 3 


613 ( 


63) 8 


983 ( 


28)11 


1011 ( 


60) 9 


241( 


153) 4 


399( 


18)13 


553 < 


10)14 


563 ( 


212) 2 


807 { 


85) 5 


892( 


47) 10 


1003 ( 


248) 1 


1251 < 


24) 11 


26 ( 


148) 4 


174 ( 


129) 5 


347( 


1)16 


348( 


107) 6 


464 ( 


149) 3 


613 < 


63) 8 


983 ( 


28)11 


1011( 


60) 9 



Hha 'Z 


ccg/c 17 


If 


71) 8 


72 ( 


67) 9 


139 ( 


6)16 


145 ( 


13) IS 






1S3{ 


2) 18 


160 ( 


18)12 


173 < 


1C3) 5 


286( 


74) 7 






360f 


15) 13 


375< 


277) 1 


652 { 


5)17 


657 ( 


34) 11 






691< 


109) 4 


800 ( 


10S) 6 


905 ( 


155) 3 


1060 ( 


15) 14 






1075 ( 


155) 2 


1240( 


43)10 






Hi a 3 I 


g/ege 17 


1( 


71) 8 


72 { 


67) 9 


139< 


6)16 


145( 


13) 15 






158( 


2) IS 


160 ( 


18)12 


173( 


103) 5 


286 ( 


74) 7 






360 [ 


15)13 


375 ( 


277) 1 


652 ( 


5)17 


657 ( 


34) 11 






691-C 


109) 4 


800 { 


10S) 6 


90S( 


155) 3 


1060 ( 


15)14 






1075 ( 


165) 2 


1240 ( 


43)10 








gcr--~^r-r-a / rungc 1 9 


K 


24)13 


25 ( 


9)15 


34( 


3)19 


37( 


45)11 






82C 


69) 9 


1SU 


104) 5 


255 ( 


82) 8 


337( 


25)12 






362 C 


4) 18 


366( 


93) 7 


459( 


186) 2 


645 ( 


17)14 






662 ( 


9)16 


671 ( 


3)20 


674 ( 


119) 4 


793 ( 


9)17 






802 ( 


96) 6 


898( 


195) 1 


1093 ( 


60) 10 


1153( 


130) 3 


F nu 4 K I 


gc/ngc 24 


1( 


33)12 


34 ( 


9)19 


43 ( 


3)22 


46< 


23)16 






69< 


86) 4 


155 ( 


48)10 


203 ( 


52) 9 


2S5< 


33)13 






288{ 


46)11 


334 C 


211) 1 


54S( 


109) 3 


654 ( 


5)21 






6S9( 


12) 18 


671 ( 


9)20 


680 ( 


24)15 


704 ( 


86) S 






790 ( 


3)23 


793 ( 


74) 8 


867 ( 


28)14 


895 ( 


3)24 






898 ( 


78) 7 


976( 


3)25 


979 ( 


83) 6 


1062C 


201) 2 






1263 f 


20) 17 
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no Sices found eor che following Restriction Endonucleases 



*n ir 
An in 

Age 1 
Apa I 
ApaL I 
Ase I 
Avr 11 
Bel I 

asAA r 
asae 1 

3sg I 



c/ z-aag 


EcoR r 


g/aaccc 


Pml r 


a/crygc 


EcoR V 


gac/acc 


ppuM r 


a/cegge 


Esp I 


gc/cnagc 


PshA r 


cggcc/c 


Fse I 


ggccgg/cc 


pvu r 


g/egcac 


Fsp I 


egc/gea 


Rma I 


a:/:aa; 


Gsu I 


ceggag 16/14 


Rsr II 


c/czagg 


Hae r 


wgg/ccw 


Sap I 


c/gacca 


Hind rn 


a/agece*" 


Sea r 


yac/g-r 


Hpa I 


gec/aac 


Sna I 


g*-"./nnacc 


Miu r 


a/cgege 


SnaB X 


gtgcag 16/14 


*tne r 


cccrae 20/13 


So« r 



cac/gcg 

rg/gwecy 

gaenn/nngee 

cgac/cg 

c/cag 

cg/gwecg 

gccccccc 1/4 

age /ace 

gca/cac 

cac/gca 

A/czaoz 
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Stq Insert pIPX26 -> Full Restriction Map 

Dra III c&carj\/gzg Nsp7324 I r/cacgy x£a I c/ctaga 

Ori I gacnrmn/nngcc NspC I rcacg/y Xca I gea/cae 

Hco47 III agc/gcc Pac I ccaac/caa Xmn r gaann/nnccc 
£co57 I ccgaag 15/14 
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nu rv 
;<an I 
3an I 

wsp : 
Hpa II 
3sir r 

Sau3A r Ana II Ms? I 

Hbo r t~t Kpa II 

Cpn EI Hpn r SctF I CirlO r 

opn : serf r ccor rr qr—i • 

Aiw r scor rr osa v 3spw r 

Nla rv Osa V t y^^ ~r 3s I N1a rv Hin? r 

ascr : asc* r ascx r 3an r h^a i 

SdfltH I 3scX Z 3apw r Fnu4H I F.TU4H I rr.u4H r 

Alw : Css^i r a*" rr 3^1 r 3spw r Frui4H I Mni : 3scU r 

1 1 I ( I I 1 1 II II I 1 1 1 1 1 I I I III 

S?a c ccg^caccACCTXlACGTCCrrTTT^ 80 
ccqa^gcca CgcrCCACTCCAOCAACAAGICGJICCACCCGCCCTt^^ JHX GTCGS l ilX. Ill 4 ' IICOCCC 

1 1 y : i 1 1 - 1 m 1 1 • -it 1 1 1 i / 1 1 • 1 . 1 . ; 11 

f*?i a 17 3; 4 0 4 3 59 77 

^ II 3L 40 52 73 

1- Q 11 12 42 73 

1. ;:1 H 13 32 42 73 

2- 1 2. 3 2 4 3 

2 ^ i4 3 2 4 5 
2 W L5 46 

5 

5 7 

cl 7 

SI 7 

u a 

~-_ sau9S r 

W": Mia rv 

Hae in Fau I 2spw r 

Nspa P ^" q r :-:a« rn 

Fo't r Sau3A r ascu r Sav:55 1 

3s P w i xcm I ?C» I Hin? r Fau I 

2 <?i ^ asr r Pfiv r cpn rr htia r ascc; : 

Hae :ir Nla rv Tac; r 3air I Cpn r Bjilf I HinP I 

5au5o r san r NspB ri asir i Aiw r asiY r Hha r 

111 < 1 1 1 ill 11 MM ill 111 tii 

CA<^Txx^crcrrccx^\co\crTCCG^ 1 6 0 

^CCSCACCAGCCGTUrniAGGa^ 
M 1 ■ ' I I -I III -II II II III -Ml II! 

86 95 L05 116 132 140 151 

87 96 111 120 133 141 151 

8 ! 100 116 L33 145 152 

38 115 133 145 1S3 

117 133 146 155 
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If 6) 2 
K 305) 2 
1( 435) 1 
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1( 574) 1 
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If 777) 1 
1 ( 115) 2 
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3( 848) 1 
436( 420) 2 
687{ 159) 2 
523 ( 228) 2 
44( 812) 1 
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PEPTIDESORT of: 



LHP.seq checJc: 6672 from: 1 to: 100 
from: 1 to: 100 



FROMSTAOEN of: LHP . txt check: 66 7 2 
< No Cootig Comments > 

with Enzymes: * 

June 26, 1997 11 : 54 
Digest with: Tryp. Peptides Sorted by Position 



Ret7 .4 
-7.3 



Chg 
0.0 



Pos From To Mol wt Ret 2.1 

1 1 - 5 608.8 9.9 
A1,E1,K1,H2 rso-6.44 Ext-0 

2 6-20 1579.6 5.7 -18.5 -2 0 1 
A3, D1,E2,F1,G2, LI, Ml, Q1,R1,T2 Iso-4 . 00 Ext-0 

3 21 - 26 631.7 18.1 15.2 0 0 0 
D1,G1,I1,K1,L1,S1 Iso-6.31 Ext-0 

4 27-44 2004.1 19.4 4.5 -10 1 
Al,Dl,El,G2,ri,Ll,Q4,Rl,S2,T2,Vl /Wa Iso-4 . 24 ' Ext-5690 
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5 45-57 1142.3 6.0 
A6,G2,Q1, R1,T1, V2 Iso-10 . 53 Ext-0 

6 58-64 806.9 5.4 
A2,E1,F1,K1,N1,Q1 Iso-6.44 Ext-0 

7 65-66 274.3 0.8 
K1,Q1 Iso-9.67 Ext-0 

8 67-77 1317.4 11.5 
Dl,E2,r2„Ll,Nl,Ql,Rl,Sl,Tl Iso-4 . 00 Ext-0 

9 78-85 908.0 4.2 1 1 
A1,G1,Q2,R1,S1, Vl,ri Iso-9.75 Ext-1280 

10 86 - 100 1668.8 1.6 -31.6 -3.0 

A2,D1,E2,F1,G1,L1,M1,Q4,S2 Iso-3.47 Ext-0 



6.5 



-8.8 



-5.3 



-6.9 



1.0 



0.0 



1.0 
-2.0 



1.0 



10 



Digest with: Tryp. Peptides Sorted by Weight 
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4 . 


5 
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Digest with: Tryp. Peptides Sorted by Retention 



Pos 
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To 


Mol Wt 


Ret2 


- 1 
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. 4 


Chg 
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11 . 
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Digest with: Chymo . Peptides Sorted by Position 



Pos 

1 



From 
1 



To 
18 



Mol Wt 
1885.1 



Ret2. 1 
21. 1 



Ret7 .4 
-9.1 



Chg 
-2.0 



Aro Acid Base Sulf ~Phil Phob 



A4,D1,E2, F1,G2,K1,L1,M2,N1,Q1,T2 Iso-4 . 00 Ext 



-2.0 



19 - 43 2747.0 23.5 3.4 

A1,D2,E2,G3,I2,K1,L2,Q4,R1,S3,T2,V1,W1 Iso-4 . 17 Ext-5690 
3 44-58 1445.6 15.4 20.5 2.0 



15 



10 



10 



11 
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A6,F1,G2,Q1,R2,T1,V2 Iso-12.50 Ext-0 

4 59-83 2862.1 -3.3 -33.3 -1.0 1 4 3 0 16 
A3,D1, E3,Gl,l2,lC2,Ll,N2,Q5,Rl,Sl,Tl,Vl,ri Iso-4 . 70 Ext-1280 

5 84 - 100 1912.0 -6.6 -29.6 -2.0 1 3 1 1 U 
A2,D1,E2,F1,G1,L1,M1,Q4/R1,S3 Iso-4 . 00 Ext-0 



Digest with: Chyrao . Peptides Sorted by Weight 
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Digest with: Chymo. Peptides Sorted by Retentioa 
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Digest with: Clos . Peptides Sorted by Position 



Pos From To Mol Wt Ret2,l Ret7.4 

1 1-20 2170.4 9.1 -25.2 
A4,D1, E3, F1,G2,K1,L1,M2,N1,Q1,R1,T2 Iso-4 

2 21-44 2617.8 31.0 20.3 
Al, D2,£l, G3, I2,K1,L2,Q4,R1,S3,T2, VI, Wl l3( 

3 45-57 1142.3 6.0 6.5 
A6 , G2,Q1, R1,T1, V2 Iso-10.53 Ext-0 

4 58-77 2362.6 4.7 -19.8 
A2 , Dl, E3 , Fl , 1 2 , K2 , LI , N2 , Q3 , Rl, SI, Tl Iso-4 

5 78-85 908.0 4.2 1.1 
A1,G1,Q2,R1,S1, VI, Yl ISO-9 . 75 EXt-1280 

6 86 - 100 1668.8 1.6 -31.6 
A2,D1,E2, F1,G1, L1,M1,Q4,S2 Iso-3.47 Ext-0 



Digest with: Clos. Peptides Sorted by Weight 
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Digest with: Clos. Peptides Sorted by Retention 
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Digest with: Myxo. Peptides Sorted by Weight 



Pos 


From 


To 


Mol Wt 


Ret2 


. 1 


Ret7 


.4 


Chg 


Aro 


Acid 


Base 


Sulf 


Phil 


Phob 


4 


65 - 


66 


274.3 


0 
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-5 
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0 


0 
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0 


2 
2 


0 
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1 - 
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608.8 


9 


9 


-7 


3 


0.0 
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3 
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26 


2193.3 


17 


3 


-2 


7 


-2.0 


1 


4 


2 


0 


11 
21 
18 


10 
13 
20 
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67 - 
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3858.1 


4 . 


3 


-36 


2 


-4.0 


2 


6 


2 


1 


3 


27 - 


64 


3917.3 


17. 


8 


3. 
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3 


0 
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65 
67 
1 
6 
27 



Digest with: Myxo . Peptides Sorted by Retention 



To Mol Wt 

66 274.3 

100 3858.1 

5 608.8 

26 2193.3 

64 3917.3 



Ret2. 1 
0.8 
4 . 3 
9.9 
17.3 
17.8 



Ret7.4 
-5.3 
-36.2 
-7.3 
-2.7 
3.4 



Chg 
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Acid Base 


Sulf 


Phil 


Phob 


1.0 
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0 1 
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-4.0 
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21 


13 
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1 1 
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-2.0 
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4 2 
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11 


10 


0.0 
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3 3 


0 


18 


20 
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Digest with: Staph. Peptides Sorted by Position 



0 
0 
1 
0 
2 



Pos From To Mol Wt Ret2.1 Ret7.4 Chg 

1 1 - 3 349.4 6.0 -12. 2 -1.0 
A1,E1,M1 I3O-3.90 Ext-0 

2 4-14 1164.3 9.9 -10.4 -1.0 
A2,D1, E1,G1,K1,L1,M1,Q1,T2 rso-4 . 24 Ext-0 

3 15-19 536.5 10.7 -4.6 -1.0 
Al, El, F1,C1,N1 Iso-3.90 Ext-0 

4 20-33 1601.8 14.9 0.6 -1.0 
D2, E1,G1, I2,K1,L1,Q2,R1,S1,T1, VI Iso-4 . 4 2 Ext-0 

5 34-60 2719.0 21.5 18.5 1.0 
A7, E1,F1,G4,L1,Q4,R2,S2,T2, V2, Wl Iso-10.38 Ext-5690 

6 61-68 916.0 -14.2 -26.7 1.0 0 
A2, El, K2, N1,Q2 Iao-9 . 51 Ext-0 

7 69-71 375.4 6.2 -16.9 -2.0 0 
01 , El , LI Iso-3.58 Ext-U) 

8 72-88 1908.0 9.1 8.2 0.0 1 
A2,Dl,El,Cl,I2,Wl,Q2,R2,S2,Tl,Vl,ri Iso-6.51 Ext-1280 

9 89-89 147.1 -1.0 -17.5 -1.0 0 
El Iso-3.90 Ext-0 

10 90 - 100 1224.4 19.5 9.9 0.0 1 
Al, F1,C1, L1,M1,Q4, S2 Iso-6.06 £xt-0 



Aro Acid Base Sulf Phil Phob 

2 



1 
2 
1 
3 
1 
1 
2 
2 
1 
0 



0 
1 
0 
2 
2 
2 
0 
2 
0 
0 



1 
1 
0 
0 
0 
0 
0 
0 
0 

1 



1 

6 
2 
9 
11 
6 
2 
10 
1 
6 



5 
3 
5 
16 
2 
1 

7 

0 

5 



Digest with: Staph. Peptides Sorted by Weight 



Pos 


From 




To 


Mol Wt 


Ret2 


- 1 


Ret7 


.4 


Chg 


Aro 


Acid 


Base 


Sulf 


Phil 


Phob 


9 


89 




89 


147 . 1 


-1 


. 0 


-17 


.5 


-1.0 


0 


1 


0 


0 


1 


0 


.1 


1 




3 


349 . 4 


6 


0 


-12 


. 2 


-1.0 


0 


1 


0 


1 


1 


2 


7 


69 




71 


375 . 4 


6 


2 


-16 


.9 


-2.0 


0 


2 


0 


0 


2 


1 


3 


15 




19 


536 . 5 


10 


7 


-4 


6 


-1.0 


1 


1 


0 


0 


2 


3 


6 


61 




68 


916 .0 


-14 


2 


-26 


7 


1.0 


0 


1 


2 


0 


6 


2 


2 


4 




14 


1164 . 3 


9 


9 


-10 


4 


-1.0 


0 


2 


1 


1 


6 


5 


10 


90 




100 


1224 .4 


19 . 


5 


9. 


9 


0.0 


1 


0 


0 


1 


6 


5 


4 


20 




33 


1601.8 


14 . 


9 


0. 


6 


-1.0 


0 


3 


2 


0 


9 


5 


8 


72 




88 


1908 .0 


9 . 


1 


8. 


2 


0.0 


1 


2 


2 


0 


10 


7 


5 


34 




60 


2719 .0 


21. 


5 


18. 


5 


1.0 


2 


1 


2 


0 


11 


16 



Digest with: Staph. Peptides Sorted by Retention 



Pos 


From 


To 


Mol Wt 


Ret2.1 


Ret7 


4 


Chg 


Aro 


Acid 


Base 


Sulf 


Phil 


Phob 


6 


61 - 


68 


916 .0 


-14 . 2 


-26 


7 


1.0 


0 


1 


2 


0 


6 


2 


9 


89 - 


39 


14 7 I 


-I 0 


-17 


•5 


-1 0 


0 




0 


0 


] 


0 



75 



fxavtorfe^cteopatre.pastetirJr 2g/0OT7 



apaort 



Pos From To Mol wt 


Ret2. 1 


Ret 7 .4 


Chg 


Aro 


Acid 


Base 


Sulf 


Phil 


Phob 


1 1-1 149.2 


13 .6 


4.2. 


0.0 


0 


0 


0 


1 


0 


1 


Ml Iso-6.06 Ext-0 




















2 2-4 349.4 


6.0 


-12.2 


-1.0 


0 


1 


0 


1 


1 


2 


A1,E1,M1 ISO-3.90 Ext-0 




















3 5-98 10112.9 


4.1 


-52.1 


-4.0 


4 


13 


9 


1 


53 


41 


A15 , 05 , E8 , F2 , G8 , 14 , K4 , L5 , Ml , N3 , Q15 , R5 , S7 , T6 , V4 , Wl , X 1 


Iso- 


4.50 


Ext-6970 


4 99 - 100 222.2 


19.9 


12.6 


0.0 


1 


0 


0 


0 


0 


2 


F1,G1 Iso-6.06 Ext-0 





















Digest with: Cnflr. Peptides Sorted by Weight 



Pos 


From 


TO 


Mol Wt 


Ret2 


1 


Ret7 


4 


Chg 


Aro 


Acid 


Base 


suir 


Phil 


Phob 


1 


1 - 


1 


149 . 2 


13 


6 


4 


2 


0.0 


0 


0 


0 


l 


0 


1 


4 


99 - 


100 


222. 2 


19 


9 


12 


6 


0.0 


1 


0 


0 


0 


0 


2 


2 


2 - 


4 


349.4 


6 


0 


-12 


2 


-1.0 


0 


1 


0 


1 


1 


2 


3 


5 - 


98 


10112.9 


4 . 


1 


-52 


1 


-4.0 


4 


13 


9 


1 


53 


' 41 



Digest with: CnJBr. Peptides Sorted by Retention 



Pos 


From 


To 


Mol Wt 


Ret2 


1 


Ret7 


4 


Chg 


Aro 


Acid 


Base 


Sulf 


Phil 


Phob 


3 


5 - 


98 


10112.9 


4 


1 


-52 


1 


-4.0 


4 


13 


9 


1 


53 


41 


2 


2 - 


4 


349 .4 


6 


0 


-12 


2 


-1.0 


0 


1 


0 


1 


1 


2 


1 


1 - 


1 


149 . 2 


13 


6 


4 . 


2 


0.0 


0 


0 


0 


1 


0 


1 


4 


99 - 


100 


222. 2 


19 . 


9 


12. 


6 


0.0 


1 


0 


0 


0 


0 


2 



Digest with.: IBcO. Peptides Sorted by Position 

Pos From To Mol wt Ret2.1 Ret7.4 Chg Aro Acid Base Suif Phil Phob 

1 1-43 4614.0 38.1 -5.1 -4.0 2 7 3 2 23 20 
A5,D3, E4, F1,G5, I2,K2,L3,M2, N1,Q5,R1,S3,T4, VI, Wl Iso-4 . 04 Ext-5690 

2 44 - 100 6183.7 -7.5 -41.2 -1.0 3 7 6 1 31 26 
All, D2, E5, F2,G4 ,I2,K2,L2,Ml,N2,Q10,R4,S4,T2,V3,rl Iso-4 .97 Ext-1280 



Digest with: XBzO, Peptides Sorted by Weight 



Pos 
1 
2 



From 
1 
44 



TO Mol Wt 
43 4614.0 
100 6183.7 



Ret2 . 1 Ret7 . 4 Chg 
38.1 -5.1 -4.0 
-7.5 -41.2 -1.0 



Aro Acid Base Sulf Phil Phob 

2 7 3 2 23 20 

3 7 6 1 31 26 



Digest with: IBzO. Peptides Sorted by Retention 



Pos 
"2 
1 



From 
44 - 
1 - 



To Mol Wt 
100 6183.7 
43 4614.0 



Ret2.1 Ret7.4 Chg Aro Acid Base Sulf Phil Phob 
-7.5 -41.2 -1.0 3 7 6 1 31 26 

38.1 -5.1 -4.0 2 7 3 2 23 20 



Digest with: Myxo. Peptides Sorted by Position 



Pos From To Mol wt Ret2.1 Ret7.4 Chg Axo Acid Base 

1 1 - 5 608.8 9.9 -7.3 0.0 0 11 
A1,E1,K1,M2 ISO-6.44 Ext-0 

2 6-26 2193.3 17.3 -2.7 -2.0 14 2 
A3, D2, E2, F1,G3, II, Kl, L2,N1,Q1, R1,S1,T2 ISO-4 . 17 Ext-0 

3 ...27 - - 64 3917.3 17.8 3.4 0.0 2 3 3 
A9,D1,E2,F1, G4,I1,!U, LI, Nl , Q6 , R2, S2, T3 , V3 , Wl Iso-6.62 Ext-5690 

4 65-66 274.3 0.8 -5-3 1.0 0 0 1 
K1,Q1 Iso-9.67 Ext-0 

5 67 - 100 3858.1 4.3 -36.2 -4.0 2 6 2 
A3,D2, E4, F1,G2, 12, L2, Ml, N1,Q7, R2, S4,T1/ Vl,lfl Iso-3.92 EXt-1280 



Sulf 
2 



Phil 
2 



11 



18 



Phob 
3 



10 



21 



20 



13 



o 



o 



76 



fx avterta(5)<: loopalrB-pa3laur.tr 2g/06/9', ^apsort 



2 


4 - 


14 


1164 


. 3 


9 


9 


-10 


4 


-1 


0 


0 


2 


1" 


1 


6 


5 


3 


15 - 


19 


536 


5 


10 


7 


-4 


6 


-1 


0 


1 


1 


0 


0 


2 


3 


4 


20 - 


33 


1601 


8 


' 14 


9 


0 


6 


-1 


0 


0 


3 


2 


0 


9 


5 


10 


90 - 


100 


1224 


4 


19 


5 


9 


9 


0 


0 


1 


0 


0 


1 


6 


5 


5 


34 - 


60 


2719 


0 


21 


5 


18 


5 


1 


0 


2 


1 


2 


0 


11 


16 



Digest with: TrypK • Peptides sorted by Position 



Pos 

1 



From 
1 



Ret2. 1 

9.9 



Ret7 . 4 
-7.3 



Chg 

0.0 



To Mol wt 
5 608.8 
Ai,El,Kl,M2 Iso-6.44 Ext-0 

2 6 - 26 2193.3 17.3 -2.7 

A3,D2,E2,F1,G3,I1,R1,L2,N1,Q1,R1,S1,T2 Iso-4 . 17 Ext-0 

3 27-64 3917.3 17.8 3.4 
A9 

4 65-66 274.3 0.8 



Aro Acid Base Sulf Phil Phob 



-2.0 



0.0 



,01, E2,F1, C4,I1,K1,L1, Nl, Q6,R2,S2, T3,V3,W1 Iso-6.62 Exf 



-5.3 



1.0 



3 

•5690 
1 



11 



18 



10 



20 





K1,Q1 ISO- 


9.67 Ext-0 












21 


' 13 




5 


67 - 


100 3858.1 


4 . 3 


-36. 2 


-4.0 2 


6 2 


1 




A3, 02, E4, F1,C2, I2,L2,M1, 


N1,Q7,R2,S4,T1, V1,T1 Iso-3 . 


92 Ext-1280 












Digest with: 


TrypK • 


Peptides 


Sorted by 


Weight 










POS 


From 


TO Mol Wt 


Ret2 . 1 


Ret7.4 


Chg Aro 


Acid Base 


Sulf 


Phil 


Phob 




4 


65 - 


66 274.3 


0 . 8 


-5.3 


1.0 0 


0 1 


0 


2 


0 




1 


1 - 


5 608.8 


9 . 9 


-7 . 3 


0.0 0 


1 1 


2 


2 


3 




2 


6 - 


26 2193.3 


17. 3 


-2.7 


-2.0 1 


4 2 


0 


11 


10 




5 . 


67 - 


100 3858.1 


4 . 3 


-36.2 


-4.0 2 


6 2 


1 


21 


13 




3 


27 - 


64 3917.3 


17 .8 


3.4 


0.0 2 


3 3 


0 


18 


20 








Digest with: 


TrypK. 


Peptides 


Sorted by 


Retention 










Pos 


From 


TO Mol Wt 


Ret2. 1 


Ret7 . 4 


Chg Aro 


Acid Base 


sulf 


Phil 


Phob 


Q 


4 


65 - 


66 274.3 


0.8 


-5 . 3 


1.0 0 


0 1 


0 


2 


0 




5 


67 - 


100 3858.1 


4 . 3 


-36 . 2 


-4.0 2 


6 2 


1 


21 


13 




1 


1 - 


5 608.8 


9 . 9 


-7 . 3 


0.0 0 


1 1 


2 


2 


3 




2 


6 - 


26 2193.3 


17 . 3 


-2.7 


-2.0 1 


4 2 


0 


11 


10 


i;\ 


3 


27 - 


64 3917.3 


17.8 


3.4 


0.0 2 


3 3 


0 


18 


20 








Digest with: 


TrypR. 


Peptides 


Sorted by 


Position 









Pos 

1 

A4 

2 

Al 

3 



From To Mol Wt Ret2 . 1 Ret7.4 Chg 

1 - 20 2170.4 9.1 -23.2 -2.0 1 

-D1,E3,F1,G2,K1,L1,M2,N1,Q1,R1,T2 Iso-4 . 24 Ext-0 

21 - 44 2617.8 31.0 20.3 -1.0 1 

D2,E1,G3,I2,K1,L2,Q4,R1,S3,T2,V1,W1 Iso-4. 42 Ext 



Aro Acid Base Sulf Phil Phob 



45 - 57 1142.3 
A6 , G2,Q1, R1,T1, V2 Iso-10 

4 58-77 2362.6 
A2,D1, E3, Fl, I2,K2,L1,M2,Q3 

5 78-85 908.0 
A1,G1,Q2, R1,S1, VI, ri Iso-9 

6 86 - 100 1668.8 
A2 / D1 / E2, F1,G1,L1,M1,Q4,S2 



6.0 6.5 
53 Ext-0 

4.7 -19.8 
R1,S1,T1 Iso-4. 

4.2 1.1. 
75 Ext-1280 

1.6 -31.6 
rso-3.47 EXt-0 



1.0 

-1.0 1 
70 Ext-0 
1.0 1 

-3.0 1 



3 

■5690 
0 



10 



14 



14 



10 



10 



10 



Digest with: TrypR. Peptides Sorted by Weight 



Pos 
5 
3 
6 
1 

4 



From 
78 - 
45 - 
86 - 
1 - 
58 - 



TO Mol Wt 

85 908.0 

57 1142.3 

100 1668.8 

20 2170.4 

77 2362.6 



Ret2 


1 


Ret7 


4 


Chg 


Aro 


Acid 


Base 


Sulf 


Phil 


Phob 


4 


2 


1 


1 


1.0 


1 


...0 


.1 


0 


4 


4 


6 


0 


6 


5 


1.0 


0 


0 


1 


0 


3 


10 


1 


6 


-31 


6 


-3.0 


1 


3 


0 


1 


9 


6 


9 


1 


-25 


2 


-2.0 


1 


4 


2 


2 


10 


10 


4 


7 


-19 


8 


-1.0 


1 


4 


3 


0 


14 


6 






in 


7 


- 1 n 


i 


1 


i 


0 


14 


10 
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77 



Pos 
6 





Digest with: 


TrypR. 


Pepcid 


From 


To 


Mol Wt 


Ret2. 1 


*et7.4 


86 - 


100 


1668.8 


1.6 


-31.6 


78 - 


85 


908.0 


4 . 2 


1.1 


58 - 


77 


2362.6 


4 . 7 


-19.8 


45 - 


57 


114 2.3 


6.0 


6.5 


1 - 


20 


2170.4 


9.1 


-25. 2 


21 - 


44 


2617.8 


31.0 


20.3 



Peptides Sorted by Retention 



Chg Aro Acid Base Sulf Phil Phob 



-3.0 
1.0 

-1.0 
1.0 

-2.0 

-1.0 



1 
1 
1 
0 
1 
1 



3 
0 
4 
0 
4 
3 



0 


1 


9 


6 


1 


0 


4 


4 


3 


0 


14 


6 


1 


0 


3 


10 


2 


2 


10 


10 


2 


0 


14 


10 



Digest with: AspN. Peptides Sorted by Position 



Ret7.4 
-4.6 



Chg 
0.0 



-2.0 1 
00 EXt-0 
0.0 0 



Pos From To Mol wt 

1 1 - 6 709.9 
A1,E1,K1,M2,T1 Iso-6.44 

2 7-23 1735.8 
A3, Dl, E2, Fl, G3, II, LI, Nl,Ql 

3 24-29 716.8 
Dl , II, Kl, LI, Q1,T1 Iso-6.31 

4 30-69 4201.6 

A9,D1,E3, Fl, G4,K2,L2,Nl, Q7 , R2 , S2, T2 , V3 , Wl Iso-6 . 

5 70-86 1908 .0 9.1 8.2 0.0 
A2,D1,E1,G1,I2,N1,Q2,R2,S2,T1, Vl,ri Iso-6.51 Ext 

6 87 - 100 1597.7 1.7 -32.1 -3.0 
A1,D1, E2, F1,G1, L1,M1,Q4, S2 Iso-3.47 Ext-0 



Ret2 
11, 
EXt-0 

11-8 -6.1 
,R1,S1,T1 Iso-4 
21.3 11.9 
Ext-0 
1.3 -26.0 



Aro Acid Base Sulf Phil 



0.0 



69 Ext-5690 

1 2 
-1280 

1 3 



20 



10 



Phob 
3 



20 



Digest with: AspN. Peptides Sorted by Weight 



Pos 


From 


To 


Mol Wt 


Ret2 


. 1 


Ret7 


. 4 


Chg 


Aro 


Acid 


1 


1 - 


6 


709.9 


11 


. 4 


-4 


.6 


0.0 


0 


1 


3 


24 - 


29 


716 .8 


21 


3 


11 


.9 


0.0 


0 


1 


6 


87 - 


100 


1597.7 


1 


7 


-32 


1 


-3.0 


1 


3 


2 


7 - 


23 


1735 .8 


11 


8 


-6 


1 


-2.0 


1 


3 


5 


70 - 


86 


1908.0 


9. 


1 


8 


2 


0.0 


1 


2 


4 


30 - 


69 


.4201.6 


1. 


3 


-26 . 


0 


0.0 


2 


4 



1 
1 

0 

1 

2 
4 



2 
0 
1 
0 
0 
0 



4 2 

9 5 

8 9 

10 7 



20 



20 







Digest with: 


AspN 




Peptides 


Sorted 


by 


Retention 








Pos 


From 


To 


Mol Wt 


Ret2 


1 


Ret7 . 4 


Chg 


Aro 


Acid Base 


Sulf 


Phil 


Phob 


4 


30 - 


69 


4 201.6 


1 


3 


-26.0 


0.0 


2 


4 4 


0 


20 


20 


6 


87 - 


100 


1597 .7 


1 


7 


-32.1 


-3.0 


1 


3 0 


1 


9 


5 


5 


70 - 


86 


1908 .0 


9. 


1 


8.2 


0.0 


1 


2 2 


0 


10 


7 


1 


1 - 


6 


709.9 


11. 


4 


-4.6 


0.0 


0 


1 1 


2 


3 


3 


2 


7 - 


23 


1735.8 


11. 


8 


-6 . 1 


-2.0 


1 


3 1 


0 


8 


9 


3 


24 - 


29 


716.8 


21. 


3 


11.9 


0.0 


0 


1 1 


0 


4 


2 



Summary for whole sequence: 

Molecular weight - 10779.72 
Average Residue Weight - 107.797 
Isoelectric point - 4.41 
Extinction coefficient - 6970 



Residues - 
Charged 



100 
-5 



Residue 


Number 


Mole Percent 


A - Ala 


16 


16.000 


B - Asx 


0 


0.000 


C - Cys 


0 


0.000 


D - Asp — 


- 5 


5.000 


E - Glu 


9 


9 .000 


F - Phe 


3 


3 .000 


G - Glv 


o 


o nnn 



faaviert>@eleopatre.paat«ur.lr ^xpaort 



78 



L - 


Leu 






5 


5.000 


M - 


Met 






3 


3 .000 


M - 


Asa 






3 


3 .000 


P - 


Pro 






0 


0.000 


Q - 


GlQ 






15 


15.000 


R - 


Arg 






5 


5.000 


S - 


Ser 






7 


7.000 


T - 


'Du- 






6 


6 .000 


V - 


val 






4 


4 .000 


W - 


Trp 






1 


1.000 


r - 


Tyr 






1 


1.000 


z - 


GJLx 






0 


0.000 


A + 


G 






25 


25 .000 


S + 


T 






13 


13 .000 


0 + 


E 






14 


14 .000 


0 + 


E * 


N 


♦ Q 


32 


3 2.000 


H + 


K + 


R 




9 


9.000 


D + 


E + 


a 


+ K + R 


23 


23.000 


r + 


L + 


M 


+ V 


16 


16.000 


F + 


W + 


r 




5 


5.000 



Enzymes that do cut: 

Tryp Chyroo Clos Cnflr IBzo Myxo Staph 

TrypR AspK 

Enzymes that do not cut: 

NH20H NTCB pH2 . 5 ProEn NoCut 



79 



What is claimed is : 

1. A purified polynucleotide wherein said polynucleotide is choosen from the group 
consisting of : 

a) a polynucleotide comprising the following nucleotide sequence of SEQ ID N° 1 : 

CTGCAGCAGGTGACGTCGTTGTTCAGCCAGGTGGGCGGCACCGGCGGCGGC 
AACCCAGCCGACGAGGAAGCCGCGCAGATG 

GGCCTGCTCGGCACCAGTCCGCTGTCGAACCATCCGCTGGCTGGTGGATCA 
GGCCCCAGCGCGGGCGCGGGCCTGCTGCG 

CGCGGAGTCGCTACCTGGCGCAGGTGGGTCGTTGACCCGCACGCCGCTGAT 
GTCTCAGCTGATCGAAAAGCCGGTTGCCC 

CCTCGGTGATGCCGGCGGCTGTTGCCGGATCGTCGGTGACGGGTGGCGCCG 
CTCCGGTGGGTCCGGGAGCGATGGGCCAG 

GGTTCGCAATCCGGCGGCTCCACCAGCCCGGGTCTGGTCGCGCCGGCACCG 
CTCGCGCAGGAGCGTGAAGAAGACGACGA 

GGACGACTGGGACGAAGAGGACGACTGGTGAGCTCCCGTAATGACAACAG 
ACTTCCCGGCCACCCGGGCCGGAAGACTTG 

CCAACATTTTGGCGAGGAAGGTAAAGAGAGAAAGTAGTCCAGCATGGCAG 
AGATGAAGACCGATGCCGCTACCCTCGGGC 

AGGAGGCAGGTAATTTCGAGCGGATCTCCGGCGACCTGAAAACCCAGATCG 
ACCAGGTGGAGTCGACGGCAGGTTCGTTG 

CAGGGCCAGTGGCGCGGCGCGGCGGGGACGGCCGCCCAGGCCGCGGTGGT 
GCGCTTCCAAGAAGCAGCCAATAAGCAGAA 

GCAGGAACTCGACGAGATCTCGACGAATATTCGTCAGGCCGGCGTCCAATA 
CTCGAGGGCCGACGAGGAGCAGCAGCAGG 

CGCTGTCCTCGCAAATGGGCTTCTGACCCGCTAATACGAAAAGAAACGGAG 
CAAAAACATGACAGAGCAGCAGTGGAATT 

TCGCGGGTATCGAGGCCGCGGCAAGCGCAATCCAGGGAAATGTCACGTCCA 
TTC ATTC CCTCC TTGAC G AGGGG AAGC AG 

TCCCTGACCAAGCTCGCAGCGGCCTGGGGCGGTAGCGGTTCGGAGGCGTAC 
CAGGGTGTCCAGCAAAAATGGGACGCCAC 

GGCTACCGAGCTGAACAACGCGCTGCAGAACCTGGCGCGGACGATCAGCG 
AAGCCGGTCAGGCAATGGCTTCGACCGAAG 

GCAACGTCACTGGGATGTTCGCATAGGGCAACGCCGAGTTCGCGTAGAATA 
GCGAAACACGGGATCGGGCGAGTTCGACC 
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TTCCGTCGGTCTCGCCCTTTCTCGTGTTTATACGTTTGAGCGCACTCTGAGA 
GGTTGTCATGGCGGCCGACTACGA 

b) a polynucleotide comprising the following nucleotide sequence of SEQ ID N°2, 
starting at its 5 'end with the nucleotide in position 1 of SEQ ID N°l and ending at its 
3' end with the nucleotide in position 524 of SEQ ID N°l, or a biologically active 
polynucleotide derivative of SEQ ID N°2 : 

CTGCAGCAGGTGACGTCGTTGTTCAGCCAGGTGGGCGGCACCGGCGGCGGC 
AACCCAGCCGACGAGGAAGCCGCGCAGATG 

GGCCTGCTCGGCACCAGTCCGCTGTCGAACCATCCGCTGGCTGGTGGATCA 
GGCCCCAGCGCGGGCGCGGGCCTGCTGCG 

CGCGGAGTCGCTACCTGGCGCAGGTGGGTCGTTGACCCGCACGCCGCTGAT 
GTCTCAGCTGATCGAAAAGCCGGTTGCCC 

CCTCGGTGATGCCGGCGGCTGTTGCCGGATCGTCGGTGACGGGTGGCGGCG 
CTCCGGTGGGTCCGGGAGCGATGGGCCAG 

GGTTCGCAATCCGGCGGCTCCACCAGCCCGGGTCTGGTCGCGCCGGCACCG 
CTCGCGCAGGAGCGTGAAGAAGACGACGA 

GGACGACTGGGACGAAGAGGACGACTGGTGAGCTCCCGTAATGACAACAG 
ACTTCCCGGCCACCCGGGCCGGAAGACTTG 

CCAACATTTTGGCGAGGAAGGTAAAGAGAGAAAGTAGTCCAGC 

c) a polynucleotide comprising the following nucleotide sequence of SEQ ID N°3, 
starting at its 5' end with the nucleotide in position 1 of SEQ ID N°l and ending at its 
3' end with the nucleotide in position 481 of SEQ ID N°l, or a biologically active 
polynucleotide derivative of SEQ ID N°3 : 

CTGCAGCAGGTGACGTCGTTGTTCAGCCAGGTGGGCGGCACCGGCGGCGGC 
AACCCAGCCGACGAGGAAGCCGCGCAGATG 

GGCCTGCTCGGCACCAGTCCGCTGTCGAACCATCCGCTGGCTGGTGGATCA 
GGCCCCAGCGCGGGCGCGGGCCTGCTGCG 

CGCGGAGTCGCTACCTGGCGCAGGTGGGTCGTTGACCCGCACGCCGCTGAT 
GTCTCAGCTGATCGAAAAGCCGGTTGCCC 

CCTCGGTGATGCCGGCGGCTGTTGCCGGATCGTCGGTGACGGGTGGCGCCG 
CTCCGGTGGGTCCGGGAGCGATGGGCCAG 

GGTTCGCAATCCGGCGGCTCCACCAGCCCGGGTCTGGTCGCGCCGGCACCG 
CTCGCGCAGGAGCGTGAAGAAGACGACGA 

GGACGACTGGGACGAAGAGGACGACTGGTGAGCTCCCGTAATGACAACAG 
_A CTTCCCGOCC. A CCCGOCtCCOOAAOACTTO 
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d) a polynucleotide comprising the following nucleotide sequence of SEQ ID N°4, 
starting at its 5' end with the nucleotide in position 525 of SEQ ID N°l and ending at 
its 3' end with the nucleotide in position 826 of SEQ ID N°l coding for the LHP 
polypeptide : 

ATGGCAGAGATGAAGACCGATGCCGCTACCCTCGGGC 

AGGAGGCAGGTAATTTCGAGCGGATCTCCGGCGACCTGAAAACCCAGATCG 
ACC AGGTGGAGTCGACGGC AGGTTCGTTG 

CAGGGCCAGTGGCGCGGCGCGGCGGGGACGGCCGCCCAGGCCGCGGTGGT 
GCGCTTCCAAGAAGCAGCCAATAAGCAGAA 

GCAGGAACTCGACGAGATCTCGACGAATATTCGTCAGGCCGGCGTCCAATA 

CTCGAGGGCCGACGAGGAGCAGCAGCAGG 

CGCTGTCCTCGCAAATGGGCTTCTG 

e) a polynucleotide comprising at least 12 consecutive nucleotides of a polynucleotide 
chosen among the group consisting of SEQ ID N°2, SEQ ID N°3 or SEQ ID N°4; 

f) A polynucleotide having a sequence fully complementary to a polynucleotide 
choosen among the group consisting of SEQ ID N°2, SEQ ID N°3 or SEQ ID N°4; 

g) a polynucleotide hybridizing under stringent hybridization conditions with 
polynucleotide choosen among the group consisting of SEQ ID N°2, SEQ ID N°3 or 
SEQ ID N°4. 

2. A polynucleotide according to claim 1 wherein said polynucleotide codes for an 
antigenic protein from Mycobacterium tuberculosis comprising the following amino 
acid sequence of SEQ ID N°4 : 

MAEMKTDAATLGQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTAAQ 
AAWRFQEAANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF 

3. A polynucleotide according to claim 1 which is labeled with a marker compound. 

4. A purified polynucleotide comprising : 

a) a polynucleotide of sequence SEQ ID N°2 or a bilogically active polynucleotide 
derivative of SEQ ID N°2; and 

b) a polynucleotide coding for a polypeptide. 

5. A purified polynucleotide comprising : 

a) a polynucleotide of sequence of SEQ ID N°3 or a biologically active 
oolvnucleoride derivative of SEO ID N°2: and 
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b) a polynucleotide coding for a polypeptide. 

6. A recombinant vector containing a polynucleotide according to anyone of claims 1 
to 5. 

7. The recombinant vector according to claim 6, which is plasmid pIPX61 that has 
been deposited at the CNCM on May, 14 1996 under the accession number 1-1705. 

8. The recombinant vector according to claim 6, which is plamid pIPX30 that has been 
deposited at the CNCM on February 13, 1997 under the accession number 1-1845. 

9. A recombinant cell host containing a purified polynucleotide according to anyone of 
claims 1 to 5 or a recombinant vector according to anyone of claims 6 to 8. 

10. The recombinant cell host according to claim 9 which is a mycobacterium cell host 
belonging to the Mycobacterium tuberculosis complex. 

11. The recombinant cell host according to claim 10 which is Mycobacterium 
tuberculosis. 

12. The recombinant cell host according to claim 10 which is Mycobacterium bovis- 
BCG. 

13. The recombinant cell host according to claim 9 which is the E. coli strain deposited 
at the CNCM on May 14, 1996 under the accession number 1-1705. 

14. The recombinant cell host according to claim 9 which is the E. coli strain deposited 
at the CNCM on February 13, 1997 under the accession number 1-1845. 

15. A recombinant cell host containing a polynucleotide of SEQ ID N°2 or a 
recombinant vector carrying SEQ ID N°2 which is Mycobacterium smegmatis.. 

16. A purified polypeptide expressed by a recombinant cell host according to anyone 
of claims 9 to 13 and 15. 
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17. A purified polypeptide of claim 16 which id choosen from the goup of 
polypeptides consisting in : 

a) a polypeptide which comprises the following aminoacid sequence of SEQ ID N°5 : 
MAEMKTDAATLGQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTAAQ 
AAVVRFQEAANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF; 

b) a polypeptide comprising : 

i) amino acid in position 1 to amino acid in position 48 of SEQ ID N°5; or 

ii) amino acid in position 60 to amino acid in position 100 of SEQ ID N°5; 

c) a polypeptide comprising at least one antigenic portion of a polypeptide a) or b). 

18. An oligomeric polypeptide comprising at least two units of a polypeptide according 
to claim 17. 

19. The oligomeric polypeptide of claim 18 comprising up to 10 units of a polypeptide 
according to claim 17. 

20. A purified polypeptide comprising at least one antigenic portion of a polypeptide 
according to claim 17. 

21. The purified polypeptide according to claim 18 wherein the antigenic portion of 
the polypeptide of sequence SEQ ID N°4 is choosen among the group consisting in the 
following antigenic portions : 

a) SEQ ID N° 6 : NH2-MAEMKTDAATLGQEAGNFERISGDLKTQIDQVESTAGS 
LQGQWRGAAGT-COOH; 

b) SEQ ID N°7 : NH2-QEAANKQKQELDEISTNIRQAGVQYSRADEEQQQ 
ALSSQMGF-COOH; 

c) SEQ ID N°8 : NH2-QEAGNFERISGDLKTQIDQV-COOH; 

d) SEQ ID N°9 : NH2-GDLKTQIDQVESTAGS-COOH; 

e) SEQ ID N° 10 : NH2-GSLQGQWRGAAGTAAA-COOH; 

f) SEQ ID N°l 1 : NH2-QEAANKQKQELDEIST-COOH; 

g) SEQ ID N°12 : NH2-STNIRQAGVQYSRADEEQQQALSSQMGF-COOH; 

h) SEQ ID N°13 : NH2-RADEEQQQ ALSSQMGF-COOH. 

22. The purified polypeptide according to anyone of claims 20 and 21 comprising from 
2 to 10 antigenic portions of the polypeptide of SEQ ID N°4. 
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23. A purified polypeptide or an oligomeric polypeptide according to anyone of claims 
16 to 23 which is under the form of a MAP construct. 

24. A purified polypeptide or an oligomeric polypeptide according to anyone of claims 
16 to 23 which comprises an additional T-epitope. 

25. A monoclonal or a polyclonal antibody directed specifically against a purified 
polypeptide or an oligomeric polypeptide according to anyone of claims 16 to 24. 

26. An immunogenic composition comprising a purified polypeptide or an oligomeric 
polypeptide according to anyone of claims 16 to 24. 

27. A vaccine composition comprising a purified polypeptide or an oligomeric 
polypeptide according to anyone of claims 16 to 24. 

28. The vaccine composition according to claim 27 wherein said vaccine composition 
comprises additionally an antigenic protein from Mycobacterium tuberculosis or an 
antigenic portion of an antigenic protein from Mycobacterium tuberculosis. 

29. The vaccine composition according to claim 28 wherein said vaccine composition 
comprises additionally the ESAT-6 antigenic protein or an antigenic portion of the 
ESAT-6 protein. 

30. A diagnostic method for detecting the presence of a Mycobacterium tuberculosis 
bacterium in a biological sample, said diagnostic method comprising the steps of : 

a) bringing into contact the biological sample expected to contain a given 
pathogenic microorganism with a purified monoclonal or polyclonal antibody 
according to claim 25; 

b) detecting the antigen-antibody complexes formed; 

31. A diagnostic method for detecting the presence of a Mycobacterium tuberculosis 
bacterium in the serum of an infected patient, said diagnostic method comprising the 
steps of ; 

a) bringing into contact the serum sample expected to contain a given pathogenic 
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b) detecting the antigen-antibody complexes formed; 

32. A diagnostic kit for the in vitro diagnosis of an infection by Mycobacterium 
tuberculosis, comprising the following elements : 

a) A purified preparation of a monoclonal or a polyclonal antibody according to 
claim 25; 

b) Suitable reagents allowing the detection of the antigen/antibody complexes 
formed , these reagents preferably carrying a label compound, or being recognized 
themselves by a labeled reagent. 

c) optionally, a reference biological sample containing the Mycobacterium 
tuberculosis antigen recognized by the purified monoclonal or polyclonal antibody 
(positive control); 

d) optionally, a reference biological sample that does not contains the 
Mycobacterium tuberculosis antigen recognized by the purified monoclonal or 
polyclonal antibody (negative control). 

33. A diagnostic kit for the in vitro diagnosis of an infection by Mycobacterium 
tuberculosis, comprising the following elements : 

a) A purified preparation of a purified polypeptide or an oligomeric polypeptide 
according to anyone of claims 16 to 24; 

b) Suitable reagents allowing the detection of the antigen/antibody complexes 
formed , these reagents preferably carrying a label compound, or being recognized 
themselves by a labeled reagent. 

c) optionally, a reference biological sample containing a polyclonal or monoclonal 
antibody recognizing the purified polypeptide or the olgomeric polypeptide of step 
a) (positive control); 

d) optionally, a reference biological sample that does not contain a polyclonal or 
monoclonal antibody recognizing the purified polypeptide or the olgomeric 
polypeptide of step a) (negative control); 



34. A method for detecting Mycobacterium tuberculosis in a biological sample 
comprising the steps of : 

a) bringing into contact a purified polynucleotide according to anyone of claims 1 to 3 
with a biological sample. 
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b) detecting the hybrid nucleic acid molecule formed between said purified 
polynucleotide and the nucleic acid molecules contained within the biological sample. 

35. The method of claim 34, wherein before step a), the nucleic acid molecules of the 
biological sample have been made available to a hybridization reaction. 

36. A method for detecting a bacterium belonging to the Mycobacterium tuberculosis 
complex or to Mycobacterium bovis in a biological sample comprising the steps of : 

a) Bringing into contact a purified polynucleotide according to anyone of claims 1 to 3 
that has been immobilized onto a substrate with a biological sample. 

b) Bringing into contact the hybrid nucleic acid molecule formed between said purified 
polynucleotide and the nucleic acid contained in the biological sample with a labeled 
polynucleotide according to anyone of claims 1 to 3, provided that said polynucleotide 
and polynucleotide of step a) have non-overlapping sequences. 

37. The method of claim 36, wherein, before step a), the nucleic acid molecules of the 
biological sample have been made available to a hybridization reaction. 

38. The method of anyone of claims 36 or 37, wherein, before step b), the nucleic acid 
molecules that are not hybridized with the immobilized purified polynucleotide are 
removed. 

39. A method for detecting a bacterium belonging to the Mycobacterium tuberculosis 
complex in a biological sample comprising the steps of : 

a) Bringing into contact the nucleic acid molecules contained in the biological sample 
with a pair of purified polynucleotides according to anyone of claims 1 to 3. 

b) Amplifying said nucleic acid molecules; 

d) detecting the nucleic acid fragments that have been amplified, for example by gel 
electrophoresis or with a labeled polynucleotide according to anyone of claims 1 to 3. 

40. The method of claim 39, wherein before step a), the nucleic acid molecules of the 
biological sample have been made available to a hybridization reaction. 

41. A kit for detecting a bacterium belonging to the Mycobacterium tuberculosis 
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b) Reagents necessary to perform a nucleic acid hybridization reaction. 

42. A kit for detecting a bacterium belonging to the Mycobacterium tuberculosis 
complex or to Mycobacterium bovis in a biological sample comprising : 

a) A purified polynucleotide according to anyone of claims 1 to 3 that is immobilized 
onto a substrate. 

b) Reagents necessary to perform a nucleic acid hybridization reaction. 

c) A purified polynucleotide according to anyone of claims 1 to 3 which is 
radioactively or non-radioactively labeled, provided that said polynucleotide and the 
polynucleotide of step a) have non-overlapping sequences. 

43. A kit for detecting a bacterium belonging to the Mycobacterium tuberculosis 
complex or to Mycobacterium bovis in a biological sample comprising : 

a) A pair of purified oligonucleotides according to anyone of claims 1 to 3; 

b) Reagents necessary to perform a nucleic acid amplification reaction; 

c) Optionally, a purified polynucleotide according to anyone of claims useful as a 
probe. 

44. A recombinant vector according to claim 6, which is plasmid pEPX26 that has been 
deposited at the CNCM on May 14, 1996 under the accession number 1-1706. 

45. A recombinant vector according to claim 6, which is plasmid pPXl that has been 
deposited at the CNCM on May 14, 1996 under the accession number 1-1707. 

46. A recombinant cell host according to claim 9, which is the E. coli strain that has 
been deposited at the CNCM on May 14, 1996 under the accession number 1-1706. 

47. A recombinant cell host according to claim 9, which is the E. coli strain that has 
been deposited at the CNCM on May 14, 1996 under the accession number 1-1707. 

48. The vaccine composition according to claim 27 comprising a recombinant cell host 
containing a polynucleotide encoding a polypeptide according to claim 1 or a 
recombinant vector containing said polynucleotide. 
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49. The vaccine composition according to claim 48, wherein said polynulcotide or said 
vector encodes both the LHP or the ESAT-6 antigenic polypeptides or antigenic 
portion thereof. 

50. The vaccine composition according to claim 48 comprising a recombinant cell host 
expressing LHP and a recombinant cell host expressing ESAT-6. 

51. The vaccine composition according to anyone of claims 48 to 50, wherein the 
recombinant cell host is an eukaryotic cell host. 

52. The vaccine composition according to anyone of claims 48 to 50, wherein the 
recombinant cell host is a prokaryotic cell host. 

53. The vaccine composition according to claim 52, wherein the recombinant cell host 
is choosen from the group of bacteria consisting in : 

a) an attenuated bacterium belonging to the tuberculosis-complex; 

b) E. coli; 

c) a bacterium belonging to the Salmonella genus; 

d) a bacterium belonging to the Pseudomonas genus. 

54. A polynucleotide useful as a primer or a probe according to claim 1 which is 
choosen from the group consisting in : 

a) SEQ ID N° 14 : 5 '-CTGC AGCAGGTGACGTCGTTG- 3' 

b) SEQ ID N°15 : 5 '-CCGGGTGGCCGGGAAGTCTGTGT-3 ' 

c) SEQ ID N°16 : 5'-ACTACTTTCTCTTTCTACCTTCC-3' 

55. A pair of oligonucleotide primers according to claim 54, which is choosen from the 
group consisting in : 

a) SEQ ID N° 14 and SEQ ID N°15; 

b) SEQ ID N°14 and SEQ ID N°16. 
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Abstract of the disclosure 

The present invention is directed to a polynucleotide carrying an open reading 
frame coding for an antigenic polypeptide from Mycobacterium tuberculosis, named 
5 Ihp, which is placed under the control of its own regulation signals which are 
functional in mycobacteria, specially in mycobacteria belonging to the Mycobacterium 
tuberculosis complex and also in fast growing mycobacteria such as Mycobacterium 
smegmatis. The invention is also directed to the polypeptide LHP encoded by Ihp and 
most preferably to suitable antigenic portions of LHP as well as to oligomeric 
10 polypeptides containing more than one unit of LHP or an antigenic portion of LHP. 
The invention concerns also immunogenic and vaccine compositions containing a 
polypeptide or an oligomeric polypeptide such as defined above, as well as antibodies 
directed specifically against such polypeptides that are useful as diagnostic reagents. In 
m another embodiment, the present invention is directed to a polynucleotide carrying the 
ti; natural regulation signals of Ihp which is useful in order to express heterologous 
wl proteins in mycobacteria. Finally, the present invention is directed to oligonucleotides 
K H comprising at least 12 consecutive nucleotides from the regulation sequence of Ihp 
[ij which are useful as reagents for detecting the presence of Mycobacterium tuberculosis 
in a biological sample. 
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FOR: A POLYNUCLEOTIDE FUNCTIONALLY CODING FOR THE LHP PROTEIN 
FROM MYCOBACTERIUM TUBERCULOSIS, ITS BIOLOGICALLY ACTTVB 
DERIVATIVE FRAGMENTS, AS WELL AS METHODS USING THE SAME 

powto to mspRirr axpmact<>ctfies 

ASSISTANT COMMISSIONER FOR PATENTS 
WASHINGTON, D.C 20231 

SIR: 

Undersigned counsel hereby grants Sandra Gothard and/or Cynihia Socgren 

representatives of Capitol Patent Services, Inc. . on behalf of Obion, Sptvak. McClelland. 

Maier & Neustadt, P.C. , Steven B. Kelber, patent counsel, the power to inspect and obtain 

two (2) certified copies of U.S. Patent Application Application Serial No. 60/052,631. 

RcspectfMlly Wtanlned. 

OBLON, SHVAK, McCLELLAND, 
MAIER & NEUSTADT, P.C. 




Steven B. Kelber 
Registration No. 30,073 
Attorney of Record 



Fourth Floor 

1755 Jcffcrson D*vis Highway 
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